MULTIPLE RECURRENCE AND CONVERGENCE FOR HARDY 
SEQUENCES OF POLYNOMIAL GROWTH 
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Abstract. We study the limiting behavior of multiple ergodic averages involving sequences 
of integers that satisfy some regularity conditions and have polynomial growth. We show 
that for "typical" choices of Hardy field functions a(t) with polynomial growth, the averages 
it Y2n=i fi{T^ a ^ n ^x) • . . . ■ fi(T l ^- a( -"^x) converge in the mean and we determine their limit. For 
example, this is the case if a(t) = t 3 ^ 2 , t log t, or t 2 + (log t) 2 . Furthermore, if {a\(t), . . . , ae(t)} 
is a "typical" family of logarithmico-exponential functions of polynomial growth, then for every 
ergodic system, the averages ~ J^n=i fi (T [<ll(n)] a;) • . . . • ft (T [<lf (n)] :r) converge in the mean to 
the product of the integrals of the corresponding functions. For example, this is the case if 
the functions cii(t) are given by different positive fractional powers of t. We deduce several 
results in combinatorics. We show that if a(t) is a non-polynomial Hardy field function with 
polynomial growth, then every set of integers with positive upper density contains arithmetic 
progressions of the form {m, m + [a(n)], . . . ,m + £[a(n)]}. Under suitable assumptions we get 
a related result concerning patterns of the form {m, m + [ai(n)], . . . ,m+ [at (n)]}. 
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1. Introduction 

In recent years there has been a lot of activity in studying the limiting behavior in 1? (/i) (as 
iV —7- oo) of multiple ergodic averages of the form 

1 N 

(1) _^h(T ai{n) x)-...-MT a ^x) 

n=l 

for various choices of sequences of integers a\ (n) , . . . , ag (n) , where T is an invertible measure 
preserving transformation acting on a probability space (X, X,/j,), and fi, ■ ■ ■ ,ft are bounded 
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measurable functions. This study was initiated in |18j . where Furstenberg studied the aver- 
ages dl]) when ai(n) = n, 02(71) = 2n, . . . , a^(n) = in, in a depth that was sufficient to give 
a new proof of Szemeredi's theorem on arithmetic progressions (|32j). Later on, Bergelson 
and Leibman in |9] extended Furstenberg's method to cover the case where the sequences 
ai(n), . . . , ae(n) are integer polynomials with zero constant term, and established a polynomial 
extension of Szemeredi's theorem. 

For 1=1 the limiting behavior in L 2 (u) of the averages (pQ) can be reduced (using the 
spectral theorem for unitary operators) to the study of certain exponential sums, and therefore 
is in a sense well understood. For £ > 2, even in the simplest cases, convergence of the averages 
(PQ) and identification of the limit turned out to be a very resistent problem. Nevertheless, 
we now have several different proofs of convergence in the case where the sequences are linear 
(|26j. [38] . [33] . [35], pQ, [25]), and the case where all the sequences are integer polynomials was 
treated in [27] and [29]. Furthermore, a rather explicit formula for the limit of the averages 
dH) can be given in the linear case (combining results from [26] and [37]), and for some special 
collections of integer polynomial sequences ([IS], p3], [30]). 

The purpose of this article is to carry out a detailed study of the limiting behavior of the 
averages ([I]) for a large class of sequences of integers ai(n), . . . , o^(n) that have polynomial 
growth (meaning ai(t)/t k — > for some k G N) but are not necessarily defined by integer 
polynomials^! For example, we shall show that for every positive c G R \ Z, and measure 
preserving transformation T, the averages 

1 N 

(2) - fi(T [nC] x) ■ f 2 (T 2 ^x) • . . . • h(T l[nC] x) 

n=l 

converge in L 2 (u), and their limit is equal to the limit of the "Furstenberg averages" 

1 N 

(3) - h^*) ■ h(T 2n x) • • • • • h(T en x). 

n=l 

More generally, the role of the sequence [n c ] in ([2]) can play any sequence [a(n)] where a(t) is 
a function that belongs to some Hardy field, has polynomial growth, and stays logarithmically 
away from constant multiples of integer polynomials (see Theorem \2.2\i . For instance, any of 
the following sequences will work (below, k is an arbitrary positive integer) 

(4) [nlogn], [n 3 /logn], [n 2 + nlogn], [n 2 + V3n], [n 2 + (log n) 2 ], [(log(n!)) fe ], [(Li(n)) fc ]. 

We also give explicit necessary and sufficient conditions for mean convergence of the averages 
([2]) when the sequence [n c ] is replaced with the sequence [a(n)] where a(t) is any function that 
belongs to some Hardy field and has polynomial growth. 

With the help of the previous convergence results we derive a refinement of Szemeredi's 
theorem on arithmetic progressions. We show that if a(t) is a function that belongs to some 
Hardy field, has polynomial growth, and is not equal to a constant multiple of an integer 
polynomial (modulo a function that converges to a constant), then for every t 6 N, every set 
of integers with positive upper densitjfl contains arithmetic progressions of the form 

{m, m + [a(n)], . . . , m + £[a(n)]} 



The case where the transformation T is a nilrotation was treated in the companion paper [14] and is an 
essential component of the present paper. 

2 A set of integers A has positive upper density if d(A) = limsup^^^ |A n {— N, . . . , N}\/(2N + 1) > 0. 
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(see Theorem I2.5p , Therefore, one can use any of the sequences in ^) in place of [ct(re)] and also 
sequences like [n 2 + logn] or [\/2^ 2 + loglogn] (these sequences are bad for mean convergence). 

Furthermore, we study the averages ([1]) for sequences that are not necessarily in arithmetic 
progression. We show that if c\, . . . , eg € R\Z are positive and distinct, then for every ergodic 
transformation T we have 

N 

(5) Jim 1 £ h(T^x) • . . . • h(T^x) /'/, <l, /'/, ,1,, 

n=l J J 

where the convergence takes place in L 2 (//,). More generally, one can replace the sequences 
[n Cl ], . . . , [n Ci ] in (JSJ) with sequences [ai(n)], . . . , [^(n)] where the functions a\(t), . . . , ae(t) are 
logarithmico-exponential and satisfy some appropriate growth conditions (see Theorem 12 .6|) . 
This enables us to establish a conjecture of Bergelson and Haland-Knutson ([7], Conjecture 8.2). 
We deduce that if c\, . . . , C£ € R\Z are positive, then every set of integers with positive upper 
density contains patterns of the form 

{m, m + [n Cl ], ... ,m+ [n Ce ]}. 

In the next section we give a more precise formulation of our main results and also define 
some of the concepts used throughout the paper. 

2. Main results 

We first introduce some basic terminology needed to state our main results. The reader will 
find more information related to the notions involved in the background section. 

All along the article we will use the term measure preserving system, or the word system, 
to designate a quadruple (X, X,/j,,T), where (X,X,fi) is a Lebesgue probability space, and 
T : X — > X is an invertible measurable map such that niT^A) = fi(A) for every A £ X. 

Let B be the collection of equivalence classes of real valued functions defined on some half- 
line (c, oo), where we identify two functions if they agree eventually!! A Hardy field is a subfield 
of the ring (B, +, •) that is closed under differentiation. With T~L we denote the union of all 
Hardy fields. If a G H is defined in [l,oo) (one can always choose such a representative of 
a(t)) we call the sequence ([a(n)]) ng N a Hardy sequence. Working within the class H eliminates 
several technicalities that would otherwise obscure the transparency of our results and the main 
ideas of their proofs. Furthermore, H is a rich enough class to enable one to deal, for example, 
with all the sequences considered in (jj]). 

An explicit example of a Hardy field to keep in mind is the set ££ that consists of all 
logarithmico-exponential functions ([23], [23]), meaning all functions defined on some half- 
line (c, oo) using a finite combination of the symbols +, — , x, :,log,exp, operating on the real 

variable t and on real constants. For example, all rational functions and the functions t^, 

tlogt, e Vloglogt/ log ( t 2 + ^ belong in ££. 

The set H is much more extensive than the set ££, for example, one can show that it 
contains all antiderivatives of elements of ££, the Riemann zeta function £, and the Euler 
Gamma function V. Let us stress though that H does not contain functions that oscillate like 
sini or ts'mt, or functions that have a derivative that oscillates, like t 100 + sint. 



The equivalence classes just defined are often called germs of functions. We are going to use the word 
function when we refer to elements of B instead, with the understanding that all the operations defined and 
statements made for elements of B are considered only for sufficiently large values of t G R. 
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The most important property of elements of rl that will be used throughout this article is 
that we can relate their growth rates with the growth rates of their derivatives. 

To simplify our exposition we introduce some notation. If a(t), b(t) are real valued functions 
defined on some half-line (u, oo) we write a(t) -< b(t) if a{t)/b{t) — > as t — > oo. (For example, 
1 -< logt -< t £ for every e > 0.) We write a(t) <C b{i) if there exists C G M. such that 
\ a (t)\ < C|6(i) | for all large enough t£K. We say that a function a(t) has polynomial growth 
if a(t) <C t k for some k G N. 

2.1. Arithmetic progressions. We are going to give a rather exhaustive collection of results 
that deal with multiple convergence and recurrence properties of Hardy sequences ([a(n)]) n( =N 
of polynomial growth. 

2.1.1. Convergence. Let a(t) be a real valued function. We say that the sequence of integers 
([ a ( n )])n<=N is good for multiple convergence if for every i G N, system (X, X , fj,, T), and functions 
fi, h, •••,/£ G £°°0), the averages 

1 N 

(6) - fi(T Hn)] x) • f 2 {T 2 ^x) • . . . • /.(T^^x) 

n=l 

converge in L 2 ([i) as iV — > oo. As mentioned in the introduction, any polynomial with integer 
coefficients is an example of such a sequence. 

The next result gives an extensive list of new examples of sequences that are good for 
multiple convergence. In fact it shows that it is a rather rare occurrence for a Hardy sequence 
with polynomial growth to be bad for multiple convergence. 

Theorem 2.1. Let a € % have polynomial growth. 

Then the sequence ([a(n)]) n£ N is good for multiple convergence if and only if one of the 
following conditions holds: 

• \a(t) — cp{t)\ >~ \ogt for every c£K and every p £ Z[t]; or 

• a(t) — cp(t) — > d for some c,d £l and some p G Z[i]; or 

• \a(t) — t/m\ ^ logi for some m G Z. 

Remarks. • It follows that the sequences in and the sequences Win 2 ], [n/2 + log n] are 
good for multiple convergence. The sequences [sfbn 2 + logn], [2n + logn] are bad for multiple 
convergence. 

• The same necessary and sufficient conditions for convergence of "single" ergodic averages 
~k J2n=i f(T^ n ^x) where previously established in [12] . 

• If a(t) is a real valued polynomial, then our argument shows that the averages ([6]) converge 
in L 2 (n) even if one replaces the limit limjv-5.00 77 Yln=i w ^ n the limit limTv-M^oo n-m Yl,n=M+ 
On the other hand, if a G % satisfies t k ^ 1 -< a(t) -< t k for some k G N, then one can show that 
the sequence ([a(n)]) n£ N takes odd (respectively even) values in arbitrarily long intervals; as a 
result the limit lim^r-Af^oo n-ai Yln=M+i T^ n ^ f does not exist in general. 

Notice that the first condition of Theorem 12. II is satisfied by the "typical" function in % with 
polynomial growth. The next result allows us to identify the limit of the averages © for such 
"typical" functions: 

Theorem 2.2. Let a G % have polynomial growth and satisfy \a{t) — cp{t)\ y logi for every 
c£R and every p G 7L\t\ . 
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Then for every £ G N, system (X, X, fj,, T), and functions fx, ■ ■ ■ , ft G L°°(p,), we have 

N N 

( ? ) J im m E T[a(n)1 h ■ T 2[a{n)] h ■ ■ ■ ■ ■ ^ [a(n)1 f e = lim - J] T™A • T 2m / 2 • . . . • T ln f t 

n=l n=l 

where the limit is taken in L 2 (fi). 

Remarks. • Examples of Hardy sequences for which this result applies are those given in @ . 

• A rather explicit formula for the limit in (J7J) can be given by combining results in [26] (see 
also [38]) and [37] . 

• If a(t) = cp(t) + d for some c£l and p G Z[t], then (J7J) typically fails. One can see this 
by considering appropriate rotations on the circle and taking a(t) = 2t,t 2 , or y/2t. 

2.1.2. Recurrence. Let a(t) be a real valued function. We say that the sequence of integers 
([ a ( n )])nGN is good for multiple recurrence if for every £ G N, system (X, X,/i,T), and set 
A G X with n(A) > we have 

(8) fi(A n T~^A n T~ 2 ^ n ^A n • • • n r^^U) > o 

for some n S N such that [o(n)] 7^ 0. One can check that if the sequence ([a(n)]) ng N is good 
for multiple recurrence, then ([8]) is satisfied for infinitely many n G N. 

Let us discuss briefly the recurrence properties of sequences defined using polynomials with 
real coefficients. If q G R[t] is non-constant and has zero constant term, then the sequence q(n) 
is good for multiple recurrence (this follows from [9] and a trick used in [6]). If q G Z[i] does not 
have zero constant term, then the sequence q(n) is good for multiple recurrence if and only if 
the range of the polynomial contains multiples of every positive integer f|13]). More generally, 
if q G R[t], then [q[n)\ is good for multiple recurrence unless q(t) has the form q(t) = cp(t) + d 
for some p G Z[t] and c,d £ 1 (one way to see this is to use Theorem 12.31 below). In this 
last case deciding whether the sequence [q(n)] is good for multiple recurrence is more delicate 
and depends on intrinsic properties of the polynomial q. For example, one can show that the 
sequences [y/En + 1] and [\/5n + 3] are good for multiple recurrence, but the sequence 
is bad for multiple recurrence^ 

Our next result shows that if one avoids polynomial sequences, then every Hardy sequence 
of polynomial growth is good for multiple recurrence: 

Theorem 2.3. Let a G T~L have polynomial growth and suppose that a{t) — cp(t) —> 00 for every 
c G R andp£ Z[t]. 

Then the sequence ([a(n)]) ng ^ good for multiple recurrence. 

Remarks. • Examples of Hardy sequences for which this result applies are given in @. It also 
applies to the sequences [V5n + log n] and [n 2 + log log n] . 

• Theorem 12. 31 was previously established in [17] under a somewhat more restrictive assump- 
tion (namely t k ~ 1 -< a(t) -< t k for some k G N). Furthermore, the single recurrence case was 
previously established by Boshernitzan (unpublished), and subsequently in [17] . 

• Let R be the set of those n G N for which ([8]) holds. Combining the multiple recurrence 
result of Furstenberg ([18]) and Theorem 12. 2\ one sees that if a(t) — cp(t) >- logt for every 

The sequence [yEn + 2] is bad for recurrence because \\[\/En + 2]/VE\\ > 1/10 for every n 6 N, where ||-|| 
denotes the distance to the closest integer. It can be shown ([6]) that the sequence [an + b], a, b £ R, is good for 
single recurrence (meaning |[8]| holds for I = 1) if and only if there exists an integer k such that ak + b £ [0, 1] 
(this is equivalent to {b/a} < l/a). For other sequences of the form [ap(n) + n], like [an 2 + b], necessary and 
sufficient conditions seem to be more complicated. 
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c 6 K and p G Z[i], then the set R has positive lower density. Unlike the case where a(t) is 
polynomial, if a G % satisfies t*^ 1 -< a(t) -< t fe for some k G N, then one can show that the 
sequence [a(n)] takes odd values in arbitrarily long intervals, and as a result for some systems 
the set R has unbounded gaps. 

2.1.3. Characteristic factors. Let (X, X , fi, T) be a system. A factor C is called a characteristic 
factor, or characteristic, for the family of integer sequences {ai(n), . . . , a^(n)}, if whenever one 
of the functions f\, . . . ,fg G L°°(^) is orthogonal to C, the averages 

1 W 

(9) _Y,fi{T ai{n) x)-...-h{T a ^x) 

n=l 

converge to in L 2 (fi) as ./V — > oo. 

It follows that if C is as above, then the limiting behavior of the averages Q remains 
unchanged if one projects each function to the factor C, meaning that the difference of the two 
averages converges to in L 2 (/j,) as iV — > oo. 

It is known that the nilfactor Z of a system (defined in Section 13. 3p is characteristic for 
every family {p(n),2p(n), . . . ,£p(n)} whenever p is an integer polynomial (|27j). We extend 
this result by showing: 

Theorem 2.4. Suppose that a G % has polynomial growth and satisfies a(t) >~ logt. 

Then for every system and I G N, the nilfactor Z of the system is characteristic for the 
family {[a(n)], 2[a(n)], . . . ,i[a(n)]}, for every i G N. 

Remarks. • If a(t) <C logt, then the result fails even for 1 = 1, the reason being that the 
sequence [a(n)] remains constant on some sub-interval of [l,iV] that has length proportional to 
N as N — > oo. Therefore, if the transformation T is weakly mixing but not strongly mixing, the 
function / has zero integral and satisfies / f-T n f d\i ■/* 0, then f±Z but j, *£n=i T^Wf A 0. 

• A related result was proved in [7] for weakly mixing systems assuming that the function 
a(t) is tempered (for a G H this is equivalent to t k ~ 1 logt -< a(t) -< t k for some k G N). Since 
the method used in [7] does not work for functions like t k log t, we will use a different approach 
to prove Theorem 12.41 

2.1.4. Combinatorics. Using the previous multiple recurrence result we derive a refinement of 
Szemeredi's Theorem on arithmetic progressions. We will use the following correspondence 
principle of Furstenberg (the formulation given is from [3]): 

Furstenberg Correspondence Principle. Let A be a set of integers. 

Then there exist a system (X,B, n,T) and a set A G X, with n(A) = d(A), and such that 

(10) d(A n (A — m) n . . . n (A — n e )) > n(A n r n M n • • • n T~ ne A) 

for every 1%%, . . . , ri£ G Z and £ G N. 

Using the previous principle and Theorem 12.31 we immediately deduce the following: 

Theorem 2.5. Let a G % have polynomial growth and suppose that a(t) — cp{t) — > oo for every 
c G K and every p G Z[t] . 

Then for every I G N, every A C Z with d(A) > contains arithmetic progressions of the 
form 

(11) {m, m + [a(n)],m + 2[a(n)], . . . , m + ^[a(n)]} 
for some m G Z and n G N wii/i [a(n)] ^ 0. 
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2.1.5. More general classes of functions. We make some remarks about the extend of the func- 
tions our methods cover that do not necessarily belong to some Hardy field. 

The conclusions of Theorems 12.21 12.31 12.41 and 12.51 hold if for some k G N the function 
a G C k+1 (R+) satisfies 

|a (fe+1) (t)| decreases to zero, l/t k ~< a^(t) -< 1, and (a (fc+1) (t)) fe -< (a^(t)) k+1 . 

(If a G rl, these three conditions are equivalent to a a(t) has polynomial growth and \a(t) — 
p(t)\ y logt for every p G M[i]".) One can see this by repeating verbatim the proofs given in this 
article and in [14]. The reader is advised to think of the second condition as the most important 
one and the other two as technical necessities (for functions in % the second condition implies 
the other two). 

As for Theorem 12. 1\ unless one works within a "regular" class of functions like it seems 
impossible to get explicit necessary and sufficient conditions. 

2.2. Several sequences. We are going to give results related to multiple convergence and 
recurrence properties involving several sequences of polynomial growth. For practical reasons 
(mainly expository) we are going to restrict ourselves to the case where all the functions involved 
are logarithmico-exponential. More technically involved arguments should enable one to extend 
the results mentioned below to the case where all the functions belong to the same Hardy field. 

Let us also remark that the results we give below are certainly less exhaustive than the results 
of Section 12.11 We are able to handle a case that includes all functions given by fractional 
powers of t and is general enough to cover a conjecture of Bergelson and Haland. The expected 
"optimal" results involving several sequences are stated in Problems 2, 3, and 4 of Section \2. 31 

2.2.1. Convergence. To simplify our statements we introduce the following class of "good" (for 
our purposes) functions: 

(12) g = {a G C{R+) : t k+£ -< a(t) -< t k+1 for some integer k > and some e > 0}. 

Equivalently, a function a G rl with polynomial growth belongs in Q unless for some integer 
k > we have t k -< a(t) -< t k+£ for every e > 0. For example, if c > 0, then t c G Q if and only if 
c is not an integer. The reader is advised to think of functions in Q as having "fractional-power 
growth rate" . 

The next result (in fact its corollary Theorem 12. 8p verifies a conjecture of Bergelson and 
Haland-Knutson ([7], Conjecture 8.2) O It shows that for "typical" logarithmico-exponential 
functions of polynomial growth the limit of the averages ([1]) exists and for ergodic systems it is 
constant. We say that the functions a\(t), . . . ,ai(t) have different growth rates if the quotient 
of any two of these functions converges to ±oo or to 0. 

Theorem 2.6. Suppose that the functions a±, . . . G C£ n Q have different growth rates. 
Then for every ergodic system (X, B,fj,,T) and f±, . . . , fg G L°° (fi) we have 

N 

( 13 ) J im 7j E /i(^ [ai(n) M • • • • • fi(T [ae{n)] x) = h dy. [ft dp 

n=l J J 

where the convergence takes place in L 2 (n). 



comment about notation. In [7] the class £.£ is denoted by H. Also, for functions in ££ our class Q 
coincides with the class T defined in [7]. 
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Remarks. • Some examples for which our result applies are given by the collections of sequences 
{[n 1 / 2 ], [n 3 / 2 ], [re 5 / 2 ]}, and {[n^], [n^ 2 log log re] , [n^logn]}. 

• Equation (|13p fails for some ergodic systems if a non-trivial linear combination of the 
functions a\(t), . . . , ae(t) is an integer polynomial other than ±t + k. 

• A substantial part of the proof (carried out in the companion paper [T3]) is consumed in 
working on a potentially non-trivial (characteristic) factor of our system. Initially we show 
that this factor has (roughly speaking) the structure of a nilsystem, only to realize later (using 
some non-trivial equidistribution results on nilmanifolds) that this factor is trivial. It would 
be nice to have a proof that avoids such diversions to non-Abelian analysis. 

If all the functions di(t), . . . ,ai(t) have sub- linear growth then Theorem 12.61 can be (rather 
easily) proved in a more general setup, where one uses iterates of I not necessarily commuting 
ergodic transformations in place of a single ergodic transformation. 

Theorem 2.7. Let a%, . . . , ai E C£ HQ have different growth rates and satisfy at(t) -< t for i = 
1, . . . ,£. Let Ti, . . . , Ti be invertible measure preserving transformations acting on a probability 
space (X, X, fi). 

Then for every fi, ■ ■ ■ , fi £ L°°{fj) we have 

i N 

J im Tr £ Zi(^ ai(n)1 *) • • • • • h{Tf l[n)] x) = f l -...-f i , 

n=l 

where fi = E(/j|X(Tj)), and the convergence takes place in L 2 (/j,). 

Remarks. • In [7J a similar result was proved for iterates of a single transformation. 

• It is not known whether similar convergence results hold without any commutativity as- 
sumption on the transformations Tj for some choice of functions ai(t) with different, at least 
linear growth rates. On the other hand, it is known (|2j) that for some choice of non-commuting 
transformations T\ , T2 and functions f\ , fa , the averages 

£n=i h( T i x ) ■ hi T 2 x ) diverge in 

L 2 (/z). 

2.2.2. Recurrence. The next multiple recurrence result is a consequence of Theorem 12.61 

Theorem 2.8. Suppose that the functions ai,...,ag £ C£ n Q have different growth rates. 
Then for every system (X, X, fi,T) and set A S X we have 

1 N 

(14) lim - Vwin r^WUn r- [aa(n) U n • • • n r~ Mn) U) > GuU)) m - 

71=1 

Remarks. • The estimate (I14p becomes an equality when the system is ergodic. 

• The lower bounds (114ft contrast the corresponding lower bounds when the functions 
di(i), aiit) are non-constant integer polynomials. In this case, (I14h fails even when £ = 1 
and ai(t) = t 2 . In fact no power type lower bound is known for any collection of polynomials 
(except of course when all the functions are equal and linear). 

2.2.3. Characteristic factors. The next result gives convenient characteristic factors for a family 
of "typical" logarithmico-exponential sequences of polynomial growth. 

Theorem 2.9. Let a%, . . . ,0^ G C£, and suppose that all the functions a,(t) and their pairwise 
differences fflj(t) — o,j{t) belong in Q (defined (|12p ). 

Then for every system its nilf actor Z is characteristic for the family {[ai(n)], . . . , [a^(n)]}. 
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Remark. A related result was proved in [7] for weakly mixing systems. In fact we are going to 
adapt the argument used in j7j to establish our result. 

2.2.4. Combinatorics. Using Furstenberg's Correspondence Principle and Theorem 12.81 we im- 
mediately deduce the following: 

Theorem 2.10. Suppose that the functions ai,...,ag £ ££ D Q have different growth rates. 
Then for every set of integers A we have 



liminf 1 £ d(A n (A - [ai(n)]) n • • • n (A - [a e (n)])) > (d(A)) £+1 . 



n=l 

2.3. Further directions. We state some open problems that are closely related to the results 
stated before. To avoid repetition we remark that in Problems 1-4 we always work with a 
family T = {ai(t), . . . ,ai(t)} of functions of polynomial growth that belong to the same Hardy 
field. With span(J r ) we denote the set of all non-trivial linear combinations of elements of J-. 

2.3.1. Convergence. The family of functions J- = {a±(t), . . . , ae(t)} is good for multiple conver- 
gence if for every system (A, X, fj,, T) and functions fi, ■ ■ ■ , ft £ -L°°( / u) the limit 



(15) ^im 1 fi(T^x) MT^x) 

n=l 

exists in L 2 ([i). 

The next problem is much in the spirit of Theorem 12.11 

Problem 1 . The family T is good for multiple convergence if and only if every function a £ 
span(J-) satisfies one of the following conditions: 

• \a(t) — cp{t)\ y- logi for every c £ M and every p £ %[t]; or 

• a(t) — cp(t) — > d for some c, d G M; or 

• \a(t) — t/m\ <C log t for some m G Z. 



The next problem provides a possible generalization of Theorem 

Problem 2. Suppose that every function a £ span(T) satisfies \a{t) — cp{t)\ >- logi for every 
c £ M and every p £ Z[t] . 

Then for every ergodic system (A, £>, fi, T) and fi, - ■ ■ , ft £ we have 



N 

ft d/J, 

n=l 

where the convergence takes place in L 2 (n). 



N 

( 16 ) J im ~kt E Zi(^ [ai(n) M • • • • • fi(T Mn)] x) = f h d(i . 

N— >oo iV i — ' J J 



We remark that if some function a G span(J r ) satisfies |a(i) — cp(i)| <C logt for some c £ 
and p £ Z[i] with deg(p) > 2, then f)16|) fails for some system. 



2.3.2. Characteristic factors. We state a possible generalization of Theorem 12.91 

Problem 3. Suppose that ai{t) >- logi and ai{t) — aj(t) y- logi whenever i ^ j. 

Then for every system its nil] 'actor Z is characteristic for the family {[a\(n)], . . . , [a£(n)]}. 

One can easily see that the stated assumptions are also necessary. 
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2.3.3. Recurrence. The next problem provides a possible extension of Theorem | 

Problem 4. Suppose that every function a 6 span(T) satisfies \a(t) — cp{t)\ — > oo for every 
c 6 R and every p € Z[i] . 

Taen /or every system (X, X , (J,,T) and A £ X with n(A) > we have 

n(A n r' ai ( n »Un • • • n r'^^U) > o 

for some n £ N sitc/i £aa£ [aj(n)] 7^ 0. 

An interesting special case of this result is when the functions ai(t), . . . , a^{t) have different 
growth and none of them is equal to a polynomial (modulo a function that vanishes at infinity). 

If all the functions ai(t), . . . ,ae(t) are integer polynomials, then necessary and sufficient 
conditions for multiple recurrence where given in [1QJ . 

2.3.4. Combinatorics. We rephrase Problem [5] in combinatorial terminology: 

Problem 4'. Suppose that every function a G span^J 7 ) satisfies \a(t) — cp(t)\ —> 00 for every 
c 6 R and every p £ Z[i] . 

TTien every A C Z wt/l d(A) > contains patterns of the form 

(17) {m,m + [ai(ra)], . . . ,m + [a*(n)]} 

/or some m G Z and n £ N wif/i 7^ 0. 

2.3.5. Commuting transformations. It seems likely that our main results remain true when one 
works with iterates of t commuting measure preserving transformations instead of iterates a 
single transformation. We state two related problems here: 

Problem 5. Let T\, . . . ,T# be commuting invertible measure preserving transformations acting 
on a probability space (X, X, fj,) and fi, . . . , G L°°(fi). 

Then for every positive real number c the following limit exists in L 2 (n) 



1 N 

(18) lim ^h(Tt\).....f £ (Ti nC] 

N— >oo iv * — ' 



n=l 



Furthermore, if c is not an integer, then (|18|) is equal to lini/v-»oo X^n=i ^T/i • • • • " Tpfe- 



For c = 1 the existence of the limit f|18|) was established by Tao in [33] (see also [35] , [T] , |25j 
for other subsequent proofs). The case < c < 1 can be easily reduced to the case c = 1 (see 
Lemma 15.11 below) . 

Problem 6. Let T\,...,Ti be commuting measure preserving transformations acting on a 
probability space (X, X, fi). Let c\, . . . , eg E R \ Z be positive and distinct. 
Then for every fi, . . . , fi £ L°°(^) we have 



1 N 

J im Jr E /ipf* 1 *) ■■■■ hiTf%) /,•...•/, 



n=l 



where fi = E(/j|X(Tj)), and the convergence takes place in L 2 (/j,). 

This is an immediate consequence of Theorem 12.71 when all the exponents Cj are smaller than 
1 and the commutativity of the transformations Tj is not needed in this case. 
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2.3.6. Prime numbers. The results of this article should remain true when one makes the 
substitution n — > p n where p n denotes the n-th prime number. For instance: 

Problem 7. If c is a positive non-integral real number, then the sequence [p£] is good for 
multiple recurrence and convergence. Furthermore, the limit of the corresponding multiple 
ergodic averages is equal to the limit of the "Furstenberg averages" (defined by §3$)). 

One could try to verify such a statement by comparing the averages along [p£] to the averages 
along the sequence [n c ] for which all required properties are known. A similar strategy was 
used in [15] to deal with double recurrence (and convergence) problems of the shifted primes. 

Another challenge is to use the Szemeredi type results of Sections 12.11 and 12.21 and prove that 
the primes contain the corresponding Hardy-field patterns. For instance: 

Problem 8. If c,c\,C2 £ K are positive, then the prime numbers contain patterns of the form 

{m, m + [n c ], m + 2[n c ]} and {m,m + [n Cl ],m + [n C2 ]}. 

We remark that using the corresponding density results (in addition to many other things), 
Green and Tao ([21]) proved the existence of arbitrarily long arithmetic progressions in the 
primes, and Tao and Ziegler ([34]) the existence of arbitrarily long polynomial progressions in 
the primes (this last result allows one to handle Problem [8] when c,c\,C2 are positive rational 
numbers). 

2.4. Structure of the article and main ideas. In Section [3] we gather some essential back- 
ground material from ergodic theory, equidistribution results on nilmanifolds, and basic facts 
about Hardy fields. Key for our study is the structure theorem of Host and Kra (Theorem I3.3[) 
and the quantitative equidistribution result of Green and Tao (Theorem I3.2p . The use of the 
latter result is rather implicit in this article since we frequently use results from the companion 
article [H] that were proved using quantitative equidistribution. 

In Section 0] we prove Theorem 12.41 which shows that under appropriate conditions the 
nilfactor is characteristic for families of the form {[o(n)], . . . , £[a(n)]}. We remark that for 
functions of "fractional-power" growth (like a(t) = t 3//2 ), this problem can be handled using 
more or less conventional techniques. But for functions that have slowly growing derivatives 
(like a(t) = tlogt) the "standard" techniques become problematic. To overcome this problem, 
we partition the positive integers into intervals of appropriate size, and in each such interval we 
use the Taylor expansion of the function to get an approximation by real valued polynomials 
of fixed degree. This approximation works well when the function stays logarithmically away 
from polynomials, and as a result functions like t 3 / 2 , tlogt, and t + (logt) 2 become practically 
indistinguishable for our purposes. After performing these initial maneuvers we are led to 
estimating some multiple ergodic averages involving polynomial iterates (Proposition 14. ll) . a 
problem that can be handled using more or less standard techniques. 

In Section [5] we prove Theorem 12.91 which shows that under appropriate conditions the 
nilfactor is characteristic for families of the form {[oi(n)], . . . , [ai(n)]}. Since we only work 
with functions of "fractional-power" growth, we are able to adapt an argument of Bergelson 
and Haland ([7]) that was used to establish a convergence result for weakly mixing systems. The 
proof consists of two steps. One first deals with the case where all the functions have at most 
linear growth (Proposition 15.3 j) . This is done by successively applying Van der Corput's lemma 
and a change of variable trick. Then one uses a modification of the polynomial exhaustion 
technique of Bergelson to reduce the general case to the case of at most linear growth. 
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In the last section we complete the proof of the convergence and recurrence results of Sec- 
tions [2J] and [22J With the exception of Theorem 12 . 71 that can be handled directly, to prove the 
other convergence results we first make use of the results from Sections [5] and [5] to show that 
the nilfactor of the system is characteristic for the appropriate multiple ergodic averages. Then 
Theorem 13,31 enables us to reduce matters to nilsystems. Finally, we use equidistribution results 
from the companion paper [T3] to verify the appropriate convergence property for nilsystems. 
The recurrence results are direct consequences of the corresponding convergence results, with 
the exception of a special case of Theorem 12.31 where the function is logarithmically close to a 
constant multiple of an integer polynomial. In this case, a somewhat complicated analysis is 
used to prove the corresponding recurrence property for nilsystems. 

2.5. Notational conventions. The following notation will be used throughout the article: 
N = {1,2,...}, T fc = R k /Z k , Tf = f oT, e(t) = e 2wit , [t] denotes the integer part of t, 
{t} = t— [t], \\x\\ = d(t,Z), E ngj 4a(n) = pn X^neA a ( n )- We sometimes write t to represent an 
element tTL of T = M/Z. By a(t) -< b(t) we mean lim^oo a(t)/b(t) = 0, and by a(t) ~ b(t) we 
mean lim^oo a(t)/b(t) is a non-zero real number, and by a{t) <C b(t) we mean \a(t)\ < C|6(i)| 
for some absolute constant C. By we denote some half-line (c, +oo). When often write oo 
instead of +oo. By Ti we denote the class of all functions that belong to some Hardy field, by 
££ the class of all logarithmico-exponential functions, and by Q the class of functions in C(K+) 
that satisfy t k+e -4 a(t) -< t k+1 for some non-negative integer k and e > 0. 

3. Background material 

3.1. Hardy fields. We collect here some basic properties of elements of T~L relevant to our 
study. The reader can find more background material in |11| and the references therein. 

Every element of % has eventually constant sign. Therefore, if a E H, then a(t) is eventually 
monotonic (since a'(t) has eventually constant sign), and the limit lim^oo a(t) exists (possibly 
infinite). For every two functions a G H, b £ C£ (b 7^ 0), we have a/b S T~L. It follows that the 
asymptotic growth ratio lim^oo a(t)/b(t) exists (possibly infinite). 

We caution the reader that T~L is not a field, and some pairs of functions in H are not 
asymptotically comparable. This defect of T~L plays a role in some of our results, and can be 
sidestepped by restricting our attention to the Hardy field of logarithmico-exponential functions 
C£. 

A key property of elements of H with polynomial growth is that one can relate their growth 
rates with the growth rates of their derivatives: 

Lemma 3.1 (Lemma 2.1 in [2]). Suppose that a E T~L has polynomial growth. We have the 
following 

(i) Ift e -< a(t) for some e > 0, then a'{t) ~ a(t)/t. 

(ii) If t~ k -< a(t) for some k G N, and a(t) does not converge to a non-zero constant, then 
a(t)/(t(logt) 2 ) -< a'(t) < a(t)/t. 

We are going to freely use all these properties in the sequel. 

3.2. Nilmanifolds. All the equidistribution results needed in this article were established in 
|14j with the exception of one result that will be needed to cover a special case of Theorem l2.31 
Below we gather some basic facts and a quantitative equidistribution result that will be used 
in its proof. 

The proofs of all the results mentioned below can be found or deduced from [28J and [22J . 
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3.2.1. Definitions and basic properties. A nilmanifold is a homogeneous space X = G/T where 
G is a nilpotent Lie group, and T is a discrete cocompact subgroup of G. If Gk+i = {e} , 
where G^ denotes the k-th commutator subgroup of G, we say that X is a k-step nilmanifold. 
With Go we denote the connected component of the identity element in G. The representation 
of a nilmanifold X as a homogeneous space of a nilpotent Lie group G is not unique. It can 
be shown (|28J) that if X is connected, then it admits a representation of the form X = G/T 
such that Go is simply connected and G = GoI\ For connected nilmanifolds X = G/T, we will 
always assume that G satisfies these two extra assumptions. 

The group G acts on G/T by left translation where the translation by a fixed element b E G 
is given by Th(gT) = (bg)T. By mx we denote the unique probability measure on X that 
is invariant under the action of G by left translations (called the normalized Haar measure) 
and Q /T denote the Borel <r-algebra of G/T. Fixing an element b E G, we call the system 
{G/T, G/T, m, Tft) a nilsystem. We call the elements of G nilrotations. 

For every b E G the set X^ = {b n T, n E N} is a nilmanifold H/A, where H is a closed 
subgroup of G that contains 6, and A = H n T is a discrete cocompact subgroup of H. 
Furthermore, for every b E G there exists an r E N such that the nilmanifold Xi,r is connected. 

A nilrotation b E G is ergodic, or acts ergodically on X, if the sequence (6"T) ne N is dense in 
X. If b E G is ergodic, then for every x E X the sequence (6 n x) n6 N is equidistributed in X. If 
the nilmanifold X is connected and b acts ergodically on X, then for every r E N the element 
b r also acts ergodically on X. 

3.2.2. A quantitative equidistribution result. If G is a nilpotent group, then a sequence g: Z — > 
G of the form <?(n) = fc^ 1 ^ • . . . • b^. k ^ n \ where bi E G, and pj are polynomials taking integer 
values at the integers, is called a polynomial sequence in G. If the maximum degree of the 
polynomials pi is at most d we say that the degree of g(n) is at most d. A polynomial sequence 
on the nilmanifold X = G/T is a sequence of the form (g(n)T) n ^z where g: 7L — > G is a 
polynomial sequence in G. 

In [22] , Green and Tao proved a quantitative equidistribution result for polynomial sequences 
on nilmanifolds X = G/T when the group G is connected and simply connected. We will need 
an extension of this result to the non-connected case. In order to state it we first introduce 
some notions from [22] and |17| . 

If X = G/T is a connected nilmanifold, the affine torus of X is defined to be the homogeneous 
space A = G/([Gq,Gq]T). It is known ([E]) that every nilrotation acting on the affine torus 
is isomorphic to a unipotent affine transformation on some finite dimensional torua^ with the 
normalized Haar measure, and furthermore the conjugation map can be taken to be continuous. 
We can therefore identify the affine torus A of a nilmanifold X with a finite dimensional torus 

and think of a nilrotation acting on A as a unipotent affine transformation on T'. 

A quasi- character of a nilmanifold X = G/T is a function ip: G — > C that is a continuous 
homomorphism of Go (to the multiplicative group {z E C: \z\ = 1}) and satisfies tp(g'y) = ip(g) 
for every 7 E T. Every quasi-character annihilates [Go, Go], and as a result factors through 
the affine torus A of X. Under an appropriate isomorphism we have that A ~ T' and every 
quasi-character of X is mapped to a character of T { . Therefore, thinking of as a character 
of T' we have ip(t) = e(n • t) for some n E l}, where • denotes the inner product operation. We 
refer to k as the frequency of tp and ||^|| = \tz\ as the frequency magnitude of ip. 



'This means T: T l — > T l has the form T(t) = b ■ S(t) for some unipotent homomorphism S of T ; and ftsT 1 . 
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If p : Z — > M is a polynomial sequence of degree fc, then p can be uniquely expressed in the 
form p(n) = Yli=o (?) a * where ctj G M. For A?" G N we define 

(19) l|e(p(n))|| Coo[iV] = max (JV* |K||) 

where \\t\\ = d(t, Z). 

Given iV G N, a finite sequence (<?(rj)r)i< n <jv is said to be 5-equidistributed if 



1 * /" 
_^F( ff (n)r)- / Fdm x 

n=l JX 



for every Lipschitz function F: X -> C where ||^|| L ip(j^) = \\F\\oc + su Vx, y &x,x^ y d^ft^ for 
some appropriate metric dx on X. 

The next result can be found in |17j (see Theorem 3.4). 

Theorem 3.2 (Corollary of Green & Tao [22]). Zei X = G/Y be a connected nilmanifold 
(with Gq simply connected), and d € N. 

Then for every small enough 5 > there exists M = Mx,d,s £ K u>ii/i the following property: 
For every N £ N, if g: Z — » G is a polynomial sequence of degree at most d such that the finite 
sequence (g(n)T)i< n <x is not 5-equidistributed, then for some non-trivial quasi- character ip 
with HV'II < M we have 

\\1>(g(n))\\ c °°m ^ M 

where we think of ip as a character of some finite dimensional torus T ( the affine torus ) and 
g{n) as a polynomial sequence of unipotent affine transformations on T . 

Remark. We have vp{g{n)) = e(p(n)) for some p G M.[x] and therefore ||V ; (fi l ( ?7 -))llc , ° o [Af] ^ s we ^ 
defined. 

3.3. Ergodic theory. Below we gather some basic notions and facts from ergodic theory that 
we use throughout the paper. The reader can find further background material in ergodic 
theory in [19], [H], [36]. 



3.3.1. Factors. A homomorphism from a system (X,X,fj,,T) onto a system (Y,y ,v, S) is a 
measurable map ir : X' —tY', where X' is a T- invariant subset of X and Y' is an S- invariant 
subset of Y, both of full measure, such that [i o tt^ 1 = v and S o 7r(a;) = 7r o T(x) for x G X' . 
When we have such a homomorphism we say that the system (Y, y, u, S) is a factor of the 
system (X,X,fj,,T). If the factor map tt: X' — > Y' can be chosen to be injective, then we 
say that the systems (X, X , /J,,T) and (Y, y, v, S) are isomorphic (bijective maps on Lebesgue 
spaces have measurable inverses). 

A factor can be characterized (modulo isomorphism) by vr _1 (^V) which is a T-invariant sub-cr- 
algebra of B, and conversely any T-invariant sub-cr-algebra of B defines a factor. By a classical 
abuse of terminology we denote by the same letter the cr-algebra y and its inverse image by 
tt. In other words, if (Y, y, v, S) is a factor of (X, X, fj,, T), we think of y as a sub-cr-algebra of 
X. A factor can also be characterized (modulo isomorphism) by a T-invariant subalgebra T of 
L°°(X, X, n), in which case y is the sub-a-algebra generated by J 7 , or equivalently, L 2 (X, y, /x) 
is the closure of F in L 2 (X, X, (j). We will sometimes abuse notation and use the sub-cr-algebra 
y in place of the subspace L 2 (X, y, y). For example, if we write that a function is orthogonal 
to the factor y, we mean it is orthogonal to the subspace L 2 (X,y, y). 
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If J 7 is a T-invariant sub-er-algebra of X and / G L 2 (fi), we define the conditional expectation 
E(/|3^) of f with respect to y to be the orthogonal projection of / onto L 2 {y). We will 
frequently make use of the identities 

J E(/|3>) dfi=ff dfi, TE(f\y) = E(Tf\y). 

If we want to indicate the dependence on the reference measure, we write E = E^. 

The transformation T is ergodic if Tf = f implies that / = c (a.e.) for some c G C. Every 
system (X, X,fj,,T) has an ergodic decomposition, meaning that we can write jx = J m d\(t), 
where A is a probability measure on [0, 1] and \xt are T-invariant probability measures on (X, X) 
such that the systems (X, X , fJ,t,T) are ergodic for t G [0, 1]. 

We say that (X,X,/jl,T) is an inverse limit of a sequence of factors (X, Xj, /J,,T) if (Xj)j £ ^ 
is an increasing sequence of T-invariant sub-a-algebras such that VjeN = ^ up to sets of 
measure zero. 

3.3.2. Seminorms and nil] 'actors. Following [26J0 for every system (X, X , fj,,T) and function 
/ G L°°(/i), we define inductively the seminorms \\f\\e as follows: For £ = 1 we set 

ll/li = ||E(/|X)|| L2(/l) 

where I is the cr-algebra of T-invariant sets. For £ > 1 we set 

(20) l/l&T = lim Ei< n <jv|||/- T n /|||f 

N— >oo 

It was shown in [26] that for every integer £ > 1, ||| • |||^ is a seminorm on L°°(yu) and it 
defines factors Zi-\ = -2^_x(T) in the following manner: the T-invariant sub-a-algebra 2^_x is 
characterized by 

for / G T°°(/i), E(/|Z*_i) = if and only if |||/||| £ = 0. 

It is shown in [26j that for every £ G N the factor Zi has a purely algebraic structure, in fact 
for all practical purposes we can assume that it is an ^-step nilsystem. 

Theorem 3.3 (Host & Kra [26]). Let (X,X,fi,T) be a system and £gN. 

Then a.e. ergodic component of the factor Z^{T) is an inverse limit of £- step nilsystems. 

Because of this result we call Z(_ the l-step nilfactor of the system. The smallest factor that 
is an extension of all finite step nilfactors is denoted by Z and is called the nilfactor of the 
system (in other words Z = VjeN-^r) ^ ne nilfactor Z is of particular interest because, as it 
turns out, it controls the limiting behavior in L 2 ([i) of the multiple ergodic averages that are 
studied in Theorems 12.11 and 12.61 

We also record two useful identities that can be easily established using the induction defi- 
nition of the seminorms 

(21) l/l? = J \\f id d\(s), = |/®7|„x^. 

where fj, = J fi s d\(s) is the ergodic decomposition associated to the system (X, X, fi, T). Hence, 
if Tt where t G [0, 1] are the ergodic components of the transformation T, then E{f\Z^{T)) = 

In [26] the authors work with ergodic systems and real valued functions, but the whole discussion can be 
carried out for non-ergodic systems as well and complex valued functions without extra difficulties. 

^We remark that the limit is the same if the average Ei< n <jv is replaced with the average where 
($jv)jvsn is any F0lner sequence in Z. 
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if and only if R(f\Z t (T t )) = for a.e. t G [0,1]. Also if / satisfies E M (/|^(T)) = 0, then 
E /iX/1 (/8)7|^_ 1 (rxr)) = 0. 



4. Characteristic factors for multiples of a single sequence 

A crucial step in the proof of Theorem 12, II is to show that for every a G T-L, not growing very 
fast or very slow, for every I G N, the nilfactor Z of a system is characteristic for the family 
{[a(n)], 2[a(n)], . . . ,-£[a(n)]}. This is the context of Theorem 12,41 which we are going to prove 
in this section. 

As is typically the case with such results, one assigns a notion of "complexity" to the relevant 
multiple ergodic averages, and then uses induction on the "complexity" to prove the result. 
This plan can be carried out without serious difficulties when the function a G % satisfies 
** _1 logi -< a{t) ■< t k for some k G N. But when a(t) = tlogt, for example, there are serious 
difficulties caused by the fact that the factor Z is not characteristic for the Hardy sequence 
[a'(n)j, the reason being that the sequence [logn] grows too slowly. To deal with such functions 
we perform some initial maneuvers that enable us to transform the problem to one where 
induction on the "complexity" is applicable. Before giving the formal argument we informally 
explain how the initial step of the proof works in a model case. 

4.1. A model problem. Suppose we want to show that the nilfactor Z is characteristic for 
the family of sequences {[nlogn], 2[nlogn], . . . ,£[nlogn]}. Let (X,X, fi,T) be a system, and 
suppose that one of the functions fi, ■ ■ ■ , fi G L°°(fj,) is orthogonal to Z. We have to show that 

(22) lim E 1<n<iV V([nlogn]) =0, 
where 

V(n) = T n f 1 -T 2n f 2 -...-T in f e , 

and the convergence takes place in L 2 (u). Our goal here is to show how to transform (I22p into 
a more manageable identity. 

It will be more convenient for us to show that 

(23) lim E N<n<N+ i( N )V([n logn]) = 

for some function l(t) that satisfies l(t) -< t (Lemma 14.31 shows that (|23p implies (|22p ). Using 
the Taylor expansion of a(t) = tlogt around the point t = N we get for every n G N that 

2 S 
Tl ft 

(N + n) log(N + n) = NlogN + nil + log N) + — - 

2N 6& 

for some £ n G [N, N + n]. Hence, if c < 2/3, then for every large iV and n = 1, . . . , [iV c ], we 
have 

n 2 ' 

[(N + n) \og{N + n)] = \a N + n(3 N + —J + e(n) 

for some ctN,PN G M. and error terms e(n) G {0,-1}. Ignoring the error terms, and writing 
[V2N] 2 in place of 2A^ (all these technical issues can be justified), we get that in order to 
establish (|23l) it suffices to show that 

2 

(24) lim E 1<n<N cV ( a N + n(3 N + Z- ) = 0. 




Since every integer between 1 and N c can be represented as n[-\/2N] + r with < n < l(N) 
N C /[V2N] and 1 < r < r(N) = [y/2N], and since 



1 1 r i=L-- - + {n[V2N} + r)a + /3 = n 2 + na r>N + {3 r , N 
[\'2N\ 2 

for some a r ^N,Pr,N E Rj we get that (|24p follows if we show that 





We have therefore reduced matters to establishing uniform estimates for some polynomial 
multiple ergodic averages, and this turns out to be a more manageable problem. 

We also remark that the argument used in the previous model example turns out to work for 
every function a E T~L of polynomial growth that satisfies \a(t) — p(t)\ >- logi for every p E R[t]. 
We give the details in the next subsection. 

4.2. Proof of Theorem 12.41 modulo a polynomial ergodic theorem. We are going to 
prove Theorem 12.41 modulo the following polynomial ergodic theorem that we shall prove in the 
next subsection: 

Proposition 4.1. Let (X, X , fi,T) be a system, and suppose that at least one of the functions 
fl, f2, ■ ■ ■ , fi € L°°{(j) is orthogonal to the nilf actor Z. 

Then for every k £ N, nonzero a £ R, bounded two parameter sequence (c7v,n)Af,neN of real 
numbers, and F0lner sequence ($7v)_/veN in 2 we have 



The main step in the deduction of Theorem 12.41 from Proposition 14.11 is carried out in 
Lemma 14.41 below. Before delving into the proof of this lemma we mention two useful in- 
gredients that will be used in its proof. The first one was proved in [14J: 

Lemma 4.2. Let a £ H have polynomial growth and satisfy \a(t) — p(t)\ y logi for every 
p € R[t}. 

Then for some k E N we have 

(26) |a (fc+1) (t)| decreases to 0, l/t k ~< a (k) {t) ~< 1, and (a (fc+1) (t)) fc -< {a (k \t)) k+1 . 

The second is the following simple result: 

Lemma 4.3. Let (V(n)) ne jsj be a bounded sequence of vectors on a normed space. Suppose that 

lim (E N<n<N+l{N) V(n)) = 

for some positive function l(t) with l(t) -< t. Then 



lim sup E ne$JvCiV ,n T^ a+P ^ fi ■ T 2 ^ nka+ P^ f 2 ■ . . . ■ T l ^ ka+ ^ f t 



L 2 M 



= 0. 



lim Ei< n <7vV(n) = 0. 
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Proof. We can cover the interval [1, N] by a union of non-overlapping intervals of the form 
[k, k + l(k)], we denote this union by In- Since l(t) ~< t and the sequence (V(n)) n& ^ is bounded 
we have that 

lim Ei< n <jv"^(n) = lim E ne i N V(n). 

N—too TV— !>oo 

Using our assumption, one easily gets that the limit limjv->oo ^n^i N V(n) is 0, finishing the 
proof. □ 

Lemma 4.4. Let (V(n)) n ^ be a bounded sequence of vectors on a normed space. Suppose that 
for every k G N and bounded two parameter sequence (cN, n )N,neN of real numbers we have 



lim sup 

N ->°° P m h -i[t], 



E l < n < N c N , n V(n k + [p(n)}) 



0. 



Then if a £ H has polynomial growth and satisfies \a(t) — p(t)\ y logi for every p G 
and (c n )n£'M is any bounded sequence of real numbers, we have 



lim E 1<n<N c n V([a(n)]) = 0. 

TV— >00 ~ ~ 



Proof. For convenience we assume that c n = 1 for every n G N, the proof is similar in the 
general case. By Lemma 14.31 it suffices to show that 



(27) 



lim E N<n<N+l f N )V([a(n)]) = 



for some function l(t) with l(t) -< t (we shall impose more conditions on l(t) as the argument 
proceeds). 

Let fcsNbe such that the conclusion of Lemma 14.21 is satisfied, namely, 

(28) |a (fc+1) (t) | decreases to 0, l/t k ~< a^(t) ~< 1, and {a^ k+l \t)) k -< (a (fc) (t)) fc+1 . 

For convenience we are going to assume that a^ k \t) is eventually positive, the proof is similar 
in the other case. 

Using the Taylor expansion of a(t) around the point t = N we get for every n G N that 



(29) 



a(N + n) = a(N) + na'(N) + ■■■ + ^-a (fc) (iV) + 

kl (k + 1)\ 



^fc+i 



for some £ n G [N, N + n\. Since |a ( - fc+1 ^(t)| is eventually decreasing, for every large N we have 
|a (fc+1) (£n)| < |a( fc+1 )(iV)|. It follows that if l(t) satisfies 

(30) {l{t)) k+1 a {k+1) (t) ~< 1, 

then for every large N and n = 1, . . . , [l(N)] we have 

(31) 



[a(N + n)]= a(N) + na'(N) + --- + —a (k >(N) +e N (n) 
l k\ J 

where the error terms ejv(^) take values in the set {0, —1} (we used that a^ k+1 \t) is eventually 
negative). For t G [0, 1] and x positive we have 



{x + t) k x k 



< 



rk+l ' 



therefore if 



d{t) 



k\ 



aW(t) 
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then setting x = [yJd{N)\ and t = { tyd(N)} in the previous estimate we get 



a«(iV) 



1 



< 



fc+1 



M [^/d(N)} k 
From this estimate and (I31D . it follows that if 

(32) (i(t))V fc) (*)) 1+i -:i, 

then for every large iV and n = 1, . . . , [£(iV)] we have 
[a(N + n)} = \a(N) + na'(N) + 



(aW(JV)) 1+ i 



e N {n) 



where the error terms e/v(n) take values in the set {—2, —1,0, 1}. Hence, in order to establish 
(p7|) it suffices to show that 



(33) 



lim sup 



E l<n</(Af)^ 



n 



+ eAr(n 



0. 



Next notice that ([33]) follows if we show that for every bounded sequence {cN,n)N,n^n we 
have 



(34) 



lim sup 

tf-+°°pe» fc _i[t] 



E l<n<«(Af)CAT,n ^ 



L[V^(A0] fc 



+ p(n) 



0. 



Indeed, it suffices to use (f34"|) when CAr )ri = gjv(fc)=i}( n ) f° r * = ~~ 2,-1,0, 1, and then add 
the corresponding identities. 

We perform one last maneuver by rewriting (|34p in a more convenient form. Notice that 
every integer between 1 and l(N) can be represented as [ tyd(N)]n + r with < n < l(N) = 
l(N)/[fyd(N)] and 1 < r < [tyd(N)]. Furthermore, if we choose l(t) so that 

(35) (l(t)) k a^(t)yl, 

then we have l(N) — > oo. Since for every bounded sequence of vectors V(n) the average 
^i<n<b(N)V(n) is equal to E 1 < r < r ( N ~ ) E 1 < n ^ N - ) V([y / d(N)]n + r) (up to negligible terms), an 
easy computation (similar to the one used in the model example) shows that in order to establish 
(|34p it suffices to show that 



(36) 



lim sup 



Ei< n < N c Ntn V([n k +p(n)j) 



0. 



Summarizing, we have reduced matters to establishing (|36|) . which holds by our assumption, 
provided that there exists a function l(t) that satisfies all the conditions imposed previously, 
namely, lit) ~< t and the conditions stated in equations ([30]) . ([32]) . and ([35]) . Equivalently, the 
function l(t) must satisfy 

1 11 

— -< l{t), lit) -< t, l(t) ~< - 1-, and l(t) -< 

hi 



i( fc )(f))fe 



(a( fc )(t))^ + ^ 



(a(fc+i)(t))Wi 

That such a function l(t) exists follows from the second and third conditions in (|28p . thus 
completing the proof. □ 

With the help of Proposition 14.11 and Lemma 14.41 it is now easy to prove Theorem 12. 4[ 
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Proof of Theorem \2.4\ Let (X, X,fi,T) be a system and suppose that at least one of the func- 
tions fx, . . . ,fi G L°°(fi) is orthogonal to the nilfactor Z. Let a G rl have polynomial growth 
and satisfy a(t) y logi. We have to show that 

(37) lim E 1<n<N T^ h • f t = 

where the convergence takes place in L 2 (fi). 

Combining Proposition 14.11 and Lemma 14.41 we immediately get that (|37p holds if a G % has 
polynomial growth and satisfies \a(t) — p(t)\ >- logi for every p G R[i]. Therefore, it remains to 
deal with the case where a(t) = p(t) + e(t), where p G R[i] is non-constant and e(t) <C logt. 

Suppose first that e(t) is bounded. Then e(n) — > c for some cgK, and as a result for every 
large n G N we have [a(n)] = [p(n) + c] + e(n) for some sequence (e(n)) ne nj with e(n) G {0, ±1}. 
Using this, we deduce that ([37]) follows from Proposition 14.11 

The last case to consider is when 1 -< e(t) -< logt. Let I m = {n G N: [e(n)] = m}. Since 
e(n + 1) — e(n) —> (this follows from e'(t) —> and the mean value theorem), and e(n) — > oo, 
it follows that for every large m the set I m is an integer interval with length that increases 
to infinity. Notice also that for n G I m we have [a(n)] = [p(n)] + m + e(n) for some sequence 
(e(n)) n£ N with e(n) G {0, ±1}. Using this, we deduce that (|37|l follows from Proposition 14.11 
This completes the proof. □ 

4.3. Proof of the polynomial ergodic theorem. Let V = {pi, . . . ,pi} be a family of poly- 
nomials with real coefficients. We say that the family V consists of non-constant and essentially 
distinct polynomials, if all the polynomials and their pairwise differences have positive degree. 
The maximum degree of the polynomials is called the degree of the polynomial family, and is 
denoted by deg('P). Given a polynomial family V, let Vi be the subfamily of polynomials of 
degree i in V . We let Wi denote the number of distinct leading coefficients that appear in the 
family Vi. The vector (d,Wd, ■ ■ ■ ,w\), where d = deg('P), is called the type of the polynomial 
family V. 

We will use an induction scheme, often called PET induction (Polynomial Exhaustion Tech- 
nique), on types of polynomial families that was introduced by Bergelson in [3]. We order 
the set of all possible types lexicographically, meaning, (d, Wd, ... ,u>i) > (d' , w' d , . . . , w'-±) if and 
only if in the first instance where the two vectors disagree the coordinate of the first vector is 
greater than the coordinate of the second vector. 

Given a family of non-constant essentially distinct polynomials V = {pi, . . .pe}, a positive 
integer h, and p G V, we form a new family V(p,h) as follows: We start with the family of 
polynomials 

{pi(t + h)- p(t), . . . , Pi (t + h)- p(t),px(t) - p(t), . . . ,p t (t) - p(t)}, 

and successively remove the smallest number of polynomials so that the resulting family consists 
of non-constant, essentially distinct polynomials. Then for every large h the function pi{t + h) — 
p(t) will be removed if and only if pi is linear (then {pi{t + h) — p(t)) — (pi(t) — p(t)) = pi(h)), 
and the function Pi(t) — p(t) will be removed if and only if p = p^. 

Example 1. If V = {t, 2t, t 2 } and p(t) = t, then we start with the family of polynomials 

{h, t + 2h, (t + h) 2 - t, 0, t, t 2 - t] 
and remove the first, second, and fourth polynomials to get 

V(t, h) = {(t + h) 2 - t, t, t 2 - t}. 
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Notice that the family V has type (2,1,2), and the family V{t,h) has smaller type, namely, 
(2,1,1). 

The main step in the proof of Proposition 14.11 is carried out in Lemma 14.71 This lemma is 
proved using induction on the type of the family of functions involved. In order to carry out 
the inductive step we will use the following: 

Lemma 4.5. Let V = {pi, . . .pi} be family of non-constant essentially distinct polynomials, 
and suppose that deg(p%) = deg(V) > 2. 

Then there exists p 6 V such that for every large h the family V(p, h) has type smaller than 
that ofV, and deg(p\(t + h) — p(t)) = deg(V(p,h)). 

Remark. Since deg(pi) > 2, no-matter what the choice of p will be, the polynomial pi(t + h) — 
p(t) is going to be an element of the family V(p, h) for every large h. 

Proof Suppose first that deg(pj) < deg(pi) for some i G {2, . . . ,£}. Let iq be such that the 
polynomial pi has minimal degree. Then p = pi has the advertised property. 

Otherwise, all the polynomials have the same degree, in which case for i = 2, ... ,£ we have 
Pi(t) = aipi(t) + qi(t) for some non-zero real numbers 02,..., ot-i and polynomials % with 
deg(gj) < deg(pi). If a.i 7^ 1 for some io £ {2, . . . ,£}, then p = p, has the advertised property. 
If cti = 1 for i = 2, . . . , £, let in be such that the function qi Q has maximal degree. Then p = pi Q 
has the advertised property. This completes the proof. □ 

We are also going to use a variation of the classical elementary lemma of van der Corput. 
Its proof is a straightforward modification of the one given in |3j, therefore we omit it. 

Lemma 4.6. Let {vn n }jv n eN be a bounded sequence of vectors in a Hilbert space, and (<3?7v)7VeN 
be a F0lner sequence of subsets o/N. For every h E N we set 



b h = linijv^oo 

Then 



^n£<S> N < VN,n+h, VN,n > 



hmTV^oo ||E ne $ 

To state our next result it will be convenient to introduce some notation. For N € N let 
'Pn = {Pi,N 1 ■ ■ ■ , P£,n} be a family of polynomials with real coefficients. We say that the 
collection (Vn)n£N is "nice" if for every N £ N the polynomials p^jv and Pi,N~Pj,N (for i / j) 
are non-constant and their leading coefficients are independent of N. 

Lemma 4.7. Let ({pi,n , ■ ■ ■ ,Pi,n})ngn be a "nice" collection of polynomial families. Let 
(X, X, [a, T) be a system, and suppose that one of the functions fi, ■ ■ ■ , fe G L°°(p) is orthogonal 
to the nilf actor Z. 

Then for every F0lner sequence (<]?7v)7VeN in 2 and bounded sequence (cN, n )N,nei>i we have 
(38) lim E ne * N c N , n T^Mn)] fl ..... T \PtM")]f t = 

N— >co 

where the convergence takes place in L 2 (pi). 

Remark. In the special case where Pi jsj = p% for i = 1, . . . ,£ we get a different prooH of the 
known result that the nilfactor Z is characteristic for any family of non-constant, essentially 
distinct polynomials of a single variable. 



^In contrast with the proof given in [55] , we do not have to work with polynomials of several variables (which 
was a key trick in [29]) in order to prove the single variable result. 
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Proof. Without loss of generality we can assume that the function f\ is orthogonal to the 
nilfactor Z. Furthermore, we can assume that H/iH^ < 1 for i = 1, . . . ,£, and |cAr jn | < 1 for 
every N, n £ N. It will be crucial for our argument to assume that the polynomial pi n has 
maximal degree within the family Vn = {pi,N, ■ ■ ■ >PI,n}- To get this extra assumption it is 
convenient to somewhat modify our goal; instead of (j38|) we shall prove that 



(39) lim 



sup 



E 



n£<I>jv 



0. 



N ^°° 11/olUIIMU- ■■.ll/ill»<i 
Notice first that ([39]) implies (13"5j) . Indeed, fl39]) gives that 

(40) lim E ne$JvCiV ,n / /o,7V • T^^/i • TfcM")]f 2 ..... T fc,*(»)]£ d/i = 

N—>oo J 

whenever /o 5 at G L°°(fi) satisfies ll/o.ivll.— < 1 for A^ G N. Using PU|) with the conjugate of the 
function E ne $ N c N , n T^M^fr ■ T^Mn)]f 2 ..... Tb*M»)]f t in place of the function / 0>JV (for 
every iV G N), we get (l38j) . 

Next we claim that when proving (|39h we can assume that the polynomial pi t N has maximal 
degree within the family Vn- Indeed, if this is not the case, then degfj^jv) < degfp^jv) for 
some i = 2, . . . , £, say this happens for i = £. After factoring out the transformation rb«,Jv(w)] 
we see that (1391) can be rewritten as 



(41) 



lim 



I /oil 



sup 

IIA|| ao ,.».[|/< 



E 



<i 



ne^Ar 



j^[P2,N{n)-pe,N(n)]+e 2iN {n) . . 2 n [P<?-i,Jv( n )-P£',iv( n )]+ e £-i,]v(™) ^„ 







for some error terms e$ jv(n) with values in the set {0, 1}. Furthermore, notice that (|4ip follows 
if we show that 



(42) lim sup Enefcjv 

iV ^ 00 ll/olU,ll/ 2 IU,..,ll/dL<i 



J<]P2,N( n )-Pt,N(n)] f 2 . ... . TlPe-i,N(n)-p e , N (n)] ^ 

Since the collection of polynomial families (V' n )n£N, where 

'P'n = {-Pt,N,Pl,N ~ Pe,N, ■ ■ ■ ,Pi-l,N ~ P£,n}' 

is also "nice", and the polynomial pi t N — Pe,N has maximal degree within the family V' N , the 
claim follows. 

Summarizing, we have reduced matters to establishing ([39]) for every system, assuming that 
the function f\ £ L°°(^) is orthogonal to the nilfactor Z and the polynomial P\,n has maximal 
degree within the family Vn- We shall do this by using induction on the type of the family of 
polynomials Vn (the type of this family is independent of N). 

The case where all the polynomials have degree 1 can be treated as in the proof of (I52p in 
Proposition 15.31 below. (For linear functions, the same argument works for any F0mer sequence 
^at in place of the intervals [1, A 7 "]; we leave the routine details to the reader.) 

Now let d > 2 and suppose that the statement holds for every "nice" collection of polynomial 
families with type smaller than (d,Wd, ■ ■ ■ ,wi)- Let (Vn)n£N> where Vn = {pi,N, ■ ■ ■ ,P£,n}, 
be a "nice" collection of polynomial families with type (d, wa, ■ ■ ■ , Wi)- 
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Using the Cauchy-Schwarz inequality we see that (|39p follows if we show that 



lim 



sup Ene$jv 

/olloojll/allooj—ill/clloo^l 



0. 



This last identity can be rewritten as 



lim 

AM-oo I 



iol 



sup Ene^jv 

IMIoo.-.ll/<lloo<l 



F ■ S [p ^ N{n)] F l ■ S [p2 ' N{n)] F 2 • . . . • S [pe < N{n)] F e d{n xfi) = 



where S = T x T and Fj = /j <g) /j for i = 0, 1, . . . , I. Using the Cauchy-Schwarz inequality we 
see that it suffices to show that 



(43) lim sup 

N -+°° ll*bllao.-.ll*illoo<i 



E 



. g\pi,N(n)]p . g]p2,N(n)] p 2 . . g\pi,N(n)]p 



L 2 (/iX/x) 







where F\ = f\ (g) /j. We choose functions i^jv, i = 2, . with sup norm at most 1, so that 
the value of the norms in (|43p is 1/N close to the supremum and use Lemma 14.61 We get that 
I3"j) follows if we show that for every large h we have 



lim 



sup 



1*2 II 



II^IU<i 



E ne $ N J S^ Pl - N( - n+k ^Fi • 5 , [P2,iv(ri+/i)] . . 5-bf : iv(n+^)]^. 

S-bl.ivCn)]^ . S [ P 2,N(n)]p 2 . _ _ _ . 5 ^,iv(n)]^ ^ x ;i ) 



Factoring out the transformation ,f?biv( n )] ; w here pjv = Pi iV f° r some j E {1, ...,£} is chosen as 
in Lemma 14.51 (the choice of i is independent of N), we can rewrite the last identity as 



lim 

jV->oo i 



sup 



E, 



F 2 || 



.11^11 <* 



gbi.iV^+^-PivMI+e^AKM)^. . ( 5'b€,Jv("+^)-PJv(n)]+e< 1 jv(ft,n)^i 



< 5'[Pi,iv( rl )-PivW]+ef+i,jv(^,")j7 1 1 . . Cj[pt,N(n)-PN(n)]+e 2 i, N (h,n) x ^\ 







for some error terms e%ji(h, n) with values in {0, 1}. For every fixed h, N G N we can partition 
the integers into a finite number of sets, that depend only on £, where all sequences ei^{h,n) 
are constant. Therefore, the last identity follows if we show that for every large h we have 



(44) lim 



7V-s>oo 



sup Ene*Ar 
^lU,..,!!^!! <i 



Cj{pi,N(n+h)-p N (n)] p ± . . g[Pl,N(n+h)-p N (n)\ p 



g\pi,N(n)-pN(n)] p ± . . g]pe,N(n)-PN{n)] pg ^/^ x ^\ 



0. 



Next notice that if the polynomial p^jv have degree 1 (this can only happen for i ^ 1), then 
Pi,N{n + h) = Pi,N(n) + cjy(h) for some cj^ih) £ R. Hence, for those values of z we can write 

gbi,iv(™+' l )-Piv(n)]^. . ( g'[Pi,iv( n )-P]v(n)]^ i _ ^[Pi,ivW-PivW]^[ c Jv('>)]+ e Jv('»,n)j?. . p~A 

for some error terms e^f(h,n) £ {0,1}. As explained before, because the error terms ejsr(h,n) 
take values in a finite set, when proving ()44p we can assume that ejs[(h,n) = 0. Therefore, we 
have reduced matters to showing that for every large h we have 



(45) lim sup 

Ar - +0O ||F ||,||F 2 || oo ,...,||F r || oo <l 



E ne$JV J F ■ S^'^+V-P^Fx ■ S^ N ^F 2 • . . . • 

S [^Mn)]p r d ^ x M ) 
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for some r 6 N, where all the polynomials involved are elements of the collection of polynomial 
families iJ^NiPN-ih))^^ (defined on the beginning of this subsection). For every large h this 
new collection of polynomial families is "nice" and by Lemma [4.51 it has type smaller than that 
of (Pn) N&i- Furthermore, the degree of the polynomial pi,Ar(n + h) —pN{n) is maximal within 
the family V^iPNj h). Since fx is orthogonal to the factor Z(T), we have that F\ = fx <g> f 1 
is orthogonal to the factor Z{S). Therefore, the induction hypothesis applies, and verifies (145p 
for every large h. This completes the induction and the proof. □ 

We are now ready to give the proof of Proposition 14.11 



Proof of Proposition \4-l\ Let (X, T) be a system, and suppose that at least one of the 
functions fx, ■ ■ ■ , fi G is orthogonal to the nilfactor Z. Our goal is to show that for 

every bounded two variable sequence (cjv,n)neN we have 



(46) lim sup 



E n ^ N c Nn T^^WJ/i • T 2 ^ nka+P ^f 2 • . . . • T^+P^ft 



= 0. 



We choose polynomials pn G Rfc_i[t], so that the norm in (|46p is 1/N close to the supremum. 
Then (|46p takes the form 



(47) lim 

Af->oo 



= 0. 



Equation (|4T|) can be rewritten as 

(48) lim E„ g $ CN n T^ nka+PN ^ fi-T^ nka+PN ^ n ^ +e2 - N ^ f 2 -. . .■T^ nka+PN ^ n ' ) ^ +ee ' N ^ fi = 

where convergence takes place in L 2 {^i) and the error terms e2,jv(^), • • • , £i.,N{ n ) take values in 
the set {0, —1, . . . , — £}. Using Lemma [4. 71 we deduce that (|48p holds, completing the proof. □ 

5. Characteristic factors for several sequences 

A crucial step in the proof of Theorem l2.6l is to show that the nilfactor Z is characteristic for 
the related multiple ergodic averages. This is the context of Theorem 12.91 which we are going 
to prove in this section. Our proof extends an argument used in [7] where a similar result was 
verified for weakly mixing systems. Since there are a few non-trivial extra complications in our 
case, we shall give our proof in more or less full detail, referring the reader to [7j only when an 
argument we need is a straightforward modification of one used there. 

5.1. The sub-linear case. First, we are going to prove Theorem 12.91 in the case where all the 
functions have at most linear growth. 

We first give two lemmas that will be used in our proof. The first is implicit in [7]: 

Lemma 5.1. Let (VA r (^))Ar,neN be a bounded two parameter sequence of vectors in a normed 
space and suppose that a G H satisfies t e -< a(t) -< t for some e > 0. 
Then 

lim (E 1<n<N V N ([a(n)]) - E 1<n<N V N (n) ) =0. 

Proof. Without loss of generality we can assume that the limits of both averages exist. 

Let win) = \m G N: = n\ and Win) = Ylm=i w ( m )- Using our hypothesis on a(t) it 

is not hard to verify (for the details see Lemma 2.5 in [7]) that 

/ . x win) n , nwin) . , , . 

49 — -> and — j ' is bounded. 

Win) Win) 
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Using the definition of w{n) and W(n), that w(n)/W(n) — > 0, and that (V)v(n))jv,neN is 
bounded, we conclude that 



lim E 1<n<N V N {[a(n)}) = lim — — — Vic(n)Vjy( 

TV— >oo ~ ~ TV— >oo W[I\) z — ' 

v ' n=l 



n). 



Let A(iV) = E„ =1 Vjv(n) and A = lim/v-s^oo A(N). Using summation by parts we have 

1 N 1 N 

w(n)V N {n) = (Nw{N)A{N) + ^ n(w(n - 1) - w(n))A(n)) . 



n=2 



Letting V/v( n ) = 1 in the previous identity we find that 

N 



1 = w(n) { Nw W + E n{ ~ w{ri - ^ - w ^ 



n=2 



It follows that 

N 



WTN)^ w{n)VN(n) ~ A 

n=l 



N 



W(N) 



< 



Nw(N)(A(N) - A) + ^2 n{w{n - 1) - w(n))(A(n) - A) 

n=2 

NW{N) \(A(N) - A)\ + W L^\ Y,<w{n - 1) - w(n))(A(n) - A) 



W(N) 



n=2 



h(N) 



h(N) 



Since Nw(N)/W(N) is bounded and A(N) — > A we have that Ii(N) — > 0. Furthermore, using 
that A(N) — > A, and W(N) — > oo, we see that for given e > we have 



N Y,n=2 \ w ( n - !) - w ( n ) 



I 2 (N) < 



e + ojv(l) 



N(w(N) -w(l)) 



e + o N (l) 



W(N) JVW W(N) 

where the last equality holds because w(n) is increasing. By (jH|), the last fraction is bounded 



by Ce for some positive C G R. This proves that 

N 



lim 

N— >oo 



l^ w (n)V(n)-A 

n=l 



< Ce. 



Since the constant C depends only on the function a(t), and e is arbitrary, the proof is complete. 

□ 

The second lemma is a well known estimate, we prove it for completeness. 

Lemma 5.2. Let (X, X,/i,T) be a system and f\ G L°°(/x). 
Then we have 



(50) 



lim sup sup Ei< ra <Ar 
N ~>°° ll/olloo<i 



J f ■ T n fi dn 



< 



'12- 
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Proof. Using the Cauchy-Schwarz inequality we see that 



E 



Kn<N 



fo ■ T n h dfi 



< E 



Kn<N 



J fo ■ T n 



fi dji 



The last average can be rewritten as 



E 



Kn<N 



J F ■ S n F 1 d(n x M ) 



where S = T x T ', Fo = fo <g> f , and Fx = fx ® f 1 . Assuming that H/olloo < 1, and using the 
Cauchy-Schwarz inequality again, we find that the last average is bounded by 

||Ei< n <Af5' n i ? i|| i2(AtX/j) . 

Taking limits, and using the ergodic theorem, we conclude that the square of the limit in (I50p 
is bounded by 

||W/l^7l|X(S))|| L2( ^ r 

where the last identity follows from the second identity in (|21j) and the definition of ||| / 1|| i . This 
establishes the advertised estimate. □ 



fill 



In the proof of Proposition ^. 31 we are going to use the symbol < Wl Wjl when some expression 

is majorized by some other expression and the implied constant depends on the parameters 
wi, ...,w k . 

Proposition 5.3. Suppose that ax, ■ ■ ■ , a,£ E C£ satisfy t e -< <Xi(t) <C t for some e > 0, and the 
same is true for the functions a«(t) — a,j(t) for i 7^ j. 

Then the factor Z is characteristic for the scheme {[ai(n)], . . . , [a^(ra)]}. 

Proof. Let us first remark that in order to carry out our argument it will be convenient to work 
with a Hardy field H that contains ££ and such that the set H + = {a G H : a{t) —> 00} is 
closed under composition and compositional inversion (££ does not have this last property). 
Such a Hardy field exists, in fact as mentioned in [11], a constructive example is the field of 
germs at 00 of Pfaffian functions, which we denote by (We are not going to make use of 
the exact structure of V.) Our goal is to show that if the functions a%, . . . , a£ S V satisfy the 
stated assumptions, and one of the functions fi, ■ ■ ■ , ft is orthogonal to the nilfactor Z, then 



(51) 



lim 



Ei<„,<ArT [ai(n)1 /i • T [a2(n)] / 2 • . . . • T Mn)] ft 



LHp) 



0. 



Without loss of generality we can assume that the function fx is orthogonal to Z. 

We are going to use induction on I to show the following: If (X, X, fj,, T) is a system, fx 
satisfies H/iH^ < 1, and the functions ax, ■ ■ ■ ,at G V satisfy the stated assumptions, then 
(52) 

sup Ex< n <N I fo-Tl^fx-T^^h-.-.-T^fedf, 

3 ,H/2lL,..,||/*lL<i " J 

We also claim an analogous estimate with /j in place of fx for i = 0,2, . . . ,£. We leave it 
to the reader to verify that such estimates follow from (I52p (for i = 2, . . . , £ by symmetry, for 
i = we factor out the transformation T^"^ and work with the resulting averages, the precise 
argument is very similar to the one given in the beginning of the proof of Lemma 14.71) . 



lim sup 

11/olL 



10 This is defined inductively as follows: We let Vi be the set of all / G C°°(R + ) that satisfy /' = p(t,f) for 
some p £ Z[to,ti], and for k > 2 we let Vk be the set of all / € C°°(R+) that satisfy /' = p(t,fi, . . . , fk-i, /) 



for some p € Z[to,ti, 



,tk] and fi £ Pi. Then V is the set of germs of functions in Ufcen^fc- 
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First we verify that (j52[) implies (|5ip . We can assume that 1 1 1 1 < 1 for % = 1, ... ,£. Since 
fi is orthogonal to Z, we have |||/i|||2^ = 0, and as a consequence the limsup in (|52|) is 0. Using 
(1521) with the conjugate of the function Ei< n <iyrT[ 0l ( n )]/i • T^™)] f 2 • . . . • T^ n ^f t in place of 
the function /o (for every JVgN), and removing the norms we get (|5ip . 

We proceed now to prove ([52]) by induction. Suppose first that t = 1. If ai(i) -< i, then we 
deduce from Lemma 15. II that 



(53) lim 



sup Ei< n <jv 

Il/Olloo^ 1 



SUp Ei< n <Ar 

ll/oll <l 



//o- 



0. 



Equation (I52p now follows by combining (j53H and the estimate in Lemma 15.21 If a\(t) ~ t, 
then ai(t) = at + e(t) for some nonzero a £ M and e{t) -< t. Assuming that a > (the other 
case can be treated similarly), one sees that limit in (152p is bounded by a constant (one can 
use [a] + 1 if a > 1 and 2 if a < 1) times the quantity 



limsup sup Ei< n <7v 



f f • T n h dp 



Combining this with Lemma 15.21 gives the advertised estimate. 

Suppose now that I > 2 and the statement holds for We first claim that when proving 

(|52p we can assume that the function a\(t) has maximal growth. Indeed, if ai(t) -< a,i(t) 
for some i G 2, ... ,£, then we can factor out the transformation T^™^ and work with the 
resulting average. We omit the details since the argument is very similar to the one given in 
the beginning of the proof of Lemma 14.71 

We consider two cases: 

Case 1. Suppose that ai(t) ~ t. We can assume that for some r £ {1, ... ,£} we have 
ai(t) = a>\t + bi(t), . . . ,a r (t) = a r t + b r (t) where on are non-zero real numbers, bi(t) -< t, and 
a r+ i(t), . . . ,ae(t) -< t. 

We choose functions fi t N with ||/i,jv|| < 1 for i = 0,2,..., I, such that the value of the 
averages in (|52p is 1/N close to the supremum. Using the Cauchy-Schwarz inequality we see 
that (|52|) follows if we show that 



(54) limsupEi< n <Ar 



fo,N • r^/i • T^f 2 , N .... T^f e>N dfA <i, ai ,.. 

Notice that the limit in (|54p is equal to 

(55) A = limsup Ei<„<jv [ F 0tN ■ S^ n ^F 1 ■ S^ n ^F 2jN S^ n ^F i>N d{^ x fjt) 

iV-5>oo J 

where 

S = TxT, F 1 = /i®/ 1) and F i>N = f iiN ® J i>N for i = 0, 2, . . . , I. 
Using first the Cauchy Schwarz inequality, and then Lemma 14.6] we see that 

(56) |vl| 2 < 4 limsupE^^H^h 



i III 2V 



H— >oo 



where 
A h = limsupEi< n <Ar 



j g[ai{n+h)] p^ . g[a 2 (n+h)] p^ N . . g[M n + h )} p t 

Sl^WlFx ■ S [a2{n)] F 2:N • . . . • S Mn)] F l)N d{n x /i) 
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We factor out the transformation S^wl, For h G N fixed and large n G N, using that 
bi(n + h) — &i(n) — > for « = 1, . . . ,r (since -< t), and dj(n + h) — ctj(n) — > for 

i = r + 1, . . . ,£ (since ai(t) -< i), we get the identities 

[aj(n + h)] - [ai(n)] = [ctj(n) - oi(n)] + [o^/i] + ei(h,n) for i = 1, . . . , r, 

[oi(n + /i)] - [ai(n)] = [a«(n) - ai(rt)] + ei(h,n) for i = r + 1, . . . ,£, 

[di(n)] - [ai(n)] = [dj(n) - ai(rt)] + ei(h,n) for i = 2, . . . 

where the error terms ei(h,n) and ei(h,n) take values in the set {0, ±1,±2}. 
We find that 

A h = limsup Ei<„<jv I F 1An ■ S^~ a ^F 2An , N S^~ a ^F^ N d(ji x /i) 

where F 1An = T^^^Fi ■ F\, F iAn ^ N = ito^-MM) F l)N ■ T g ^ h ^F~^ for i = 2, . . . , r, 
and F iArhN = T e '^F^ N ■ T Si( - h ' n ^ F itN for % = r + 1, . . . ,t Next notice that for every fixed 
h G N we can partition the integers into a finite number of sets, that depend only on £, 
where all sequences ei(h,n),ei(h,n) are constant. It follows that for every h G N there exists 
i G {0,±1,±2} such that 



A h <Q limsup sup Ei< n <Ar 

N^oo ||J!li|| 00 ,...,||F < || 0Q <l 



where F^ h i = T^- aih ^ +l Fi ■ F\. Since the functions a 2 {t) — a\(t), . . . , a^(t) — a\(t) satisfy the 
assumptions of the induction hypothesis, it follows that 

( 57 ) A h <i,a 2 -ai,...,a e -a 1 i= ^f ±2 l-^Wlk^-l)- 

{\F\^h,i\k is constructed using the system {X x X, X x X, S, fj, x fi).) Since 

F x , h ,i = T^ +i F x ■ F\ = T^ +i (h ®h)-(h® h) = (T^+Vi • Ji) ® (rM4+i/i.7T), 
and (|2~T1) gives 

l/®7i* = i/iifi 

for every / G L°°(/x) and G N, we conclude that 

(58) \\Fi,hA\2 { l-l) = \\T [a ^h-Tltu-i- 
Combining ([5"7|) and ([58]) we get 

(59) limsupE 1 </i<H J A/ l < £)02 _ 01 0£ _ 01 max limsup Ei^hIT^+Vi • /llL-i- 

ff-*00 4 = 1, ..,5 fl-^oo 

Finally, one sees that 

(60) limsup Ei^HlT 1011 ^/! < ([<*i] + 1) limsup Ei< fc < ff |r h /i • AIm-i, 
and using Holder's inequality we get 

(61) limsupE 1 <^< i/ |r' l /l - AlL-i < limsup (Ei< h < H |T h /l • Alg^ 1 ) ^ = III/1III2V 
(The last equality follows from (|2"0]l .) Combining (JSgJ, ([59]), (JBUJ) , and flST]), we deduce that 

1^1 <f,ai,...,a £ Ill/ill^- 

This proves (|54l) and completes the induction step in the case where a\(t) ~ t. 
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Case 2. It remains to deal with the case ai(t) -< t. For i = 1,...,£, we write aj(i) = 
a,i(ai(t)) where di(t) = ai(aj~ 1 (t)). Notice that di(t) is an element of V since V is closed under 
composition and compositional inversion. Keeping in mind that a\{i) has maximal growth and 
using Lemma 2.7 and Proposition 2.11 in [7], we get for some e > that t e -< di(t) <C t for 
% = 1, . . . ,£ and t £ ~< di(t) — dj(t) <C i for i ^ j. Hence, the functions eii(i) = t, d2(£)> ■ • • > <^(i) 
satisfy the assumptions of the induction hypothesis. 

Using Lemma 2.7 in [7] we get for i = 2, . . . , £ that 

[ai(n)] = [di([ai(n)})} +ej(n) 

where the error terms ej(n) take values on a finite set. Therefore, the left hand side in (I52p is 
equal to 



lim sup 

Af->oo ||/o 



SUp Ei< n < N 

f2\\ oc ,--;\\ft\\ oc <l 



OQ ' II-* " ItQQ 



y o _ji[ai(n)] y r ^[a2([ai(n)])]+ e 2(n) j^. _ _ .J>[S^([ai (n)])]+e<> (n) ^ ^ 



Since the error terms ej(n) take values on a finite set, say with cardinality K (with K depending 

on the aj's only), one sees that the previous limit is bounded by K 1 times 

(62) 



lim sup sup Ei<„<tv 

[|/b[L,[|/ii[L,...,||/e|| ao <l 



f Q . j^M™)] fa . j#2([ai(n)])] J 2 . . . . . y[a<([*i («)])] J £ ^ 



Since i £ -< ai(i) -< i, we can apply Lemma [5TT1 for the sequence of real numbers (V]\r(n))N,n& 
defined by 

V N (n) = I / Q • T^^/i.JV • WD! / 2)i v • • • • • T^ ai WD! / £)iV d/i 



where the functions fijf are chosen so that the average in (|62H gets 1/N close to the supremum. 
We get that the limit in (j62H is equal to 



lim sup sup Ei< n <jy 

*->co |[/o|[ 00 ,|[/2lU,..,||M| 00 <i 



Since the functions t,a,2(t), . .. ,ag(t) satisfy the assumptions of the induction hypothesis, we 
are reduced to Case 1. This completes the induction step and the proof. □ 

5.2. The general case. We are going to use an inductive method analogous to the one used 
in Section T4.3I for polynomial families. First, following |7], we introduce a notion of complexity 
that is suitable for our current setup. 

We remind the reader that G = {a £ C(M+) : t k+£ -< a(t) -< t h+1 for some k > and e > 0}. 
We say that a family T = {ax(t), . . . agit)} of functions in CE is "nice" if di{t) £ Q and 
Oj — dj € Q for i ^ j. Given any such "nice" family we associate a vector (d,nd, ■ ■ • ,ni,no), 
called the type of J-, with non-negative integer entries, as follows: For every non-negative 
integer i let 

Ji = {a € F: f -< a(t) ~< t i+1 }. 

We say that two functions a,b & T\ are equivalent if a(t) — b(t) -< t l , and we define to be 
the number of non-equivalent elements of T%. We denote the maximum integer i for which 
ri{ ^ by d. We order the set of all possible types lexicographically, meaning, (d, Wd, ■ ■ ■ , W\) > 
(d' , w' d , . . . , w±) if and only if in the first instance where the two vectors disagree the coordinate 
of the first vector is greater than the coordinate of the second vector. 
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Example 2. If T = {t 1 / 3 , t 5 / 2 , i 5 / 2 + i 1 / 2 , t 5 / 2 + i 7 / 3 }, then the second and third functions are 
equivalent and all the other functions are non-equivalent. Hence, the type of J- is (2, 2, 0, 1). 

Given a "nice" family of functions T = {«i(t), . . . ai(t)}, a positive integer h £ N, and a £ J 7 , 
we form a new family J 7 (a(t), h) as follows: We start with the family of polynomials 

{a\{t + h) — a(i), . . . , ai(t + h) — a(t),ai(t) — a(t), . . . , a^(t) — a(t)}, 

and successively remove the smallest number of functions so that the remaining set consists of 
unbounded functions whose pairwise differences are also unbounded. Then for every large h 
the family J- (a, h) is also "nice" (if non-empty). The function ai{t + h) — a(t) will be removed 
if and only if aj(i) -< t, and the function aj(i) — a(t) will be removed if and only if a(t) = ai(t). 

Example 3. If T = {t 1 ^ 3 , i 1 / 2 , i 3 / 2 } and a(t) = i 1//3 , then we start with the family of polyno- 
mials 

{(£ + fc)V3 _ t l/3 > (t + ^1/2 _ t l/ 3j (t + h) 3/2 _ t l/3 )t l/3 _ ^3^1/2 _ ^1/3^3/2 _ f l/3 } 

and remove the first, second, and fourth functions to get 

T(t 1 / 3 , h) = {(t + hf' 2 - t 1 ' 3 , t 1 ' 2 - t 1 / 3 , t 3 ' 2 - t 1 ' 3 }. 

Notice that the family T has type (1, 1, 2), and the family J 7 ^ 1 / 3 , h) has smaller type, namely, 
(1,1,1)- 

To prove Theorem 12.91 we are going to use induction on the type of the family of functions 
involved. In order to carry out the inductive step we will use the following: 

Lemma 5.4. Let T = {ai(t), . . . ai(t)} be a "nice" family of functions. Suppose that ai(t) y t, 
and ai(t) has maximal growth rate within T , meaning a,i(t) <C oti(t) for i = 2, . . . , £. 

Then it is possible to choose a G T such that for every large h the family T{a{t), h) is "nice", 
has type smaller than that of J- , and the function a\(t + h) — a(t) has maximal growth rate 
within the family J 7 (a(t), h). 

Remark. Since a\(t) y t, no-matter what the choice of the function a(t) will be, the function 
ai(t + h) — a{t) is going to be an element of the family J 7 (o(t), h) for every large h. 

Proof. Suppose first that ai{t) -< a\(t) for some i 6 {2, ...,£}. Let i$ be such that the function 
ai (t) has the minimal growth (meaning aj (i) <C «i(t) for i = 1, ... ,£). Then a(t) = a,i Q (t) has 
the advertised property. 

Otherwise, aj(t) ~ ai(t) for i = 1, ...,£, in which case, for i = 2, . . . ,£, we have ai(t) = 
diai(t) + bi(t), for some non-zero real numbers ct2, ■ ■ ■ ,ot£, and functions bi(t) with b{(t) -< di(t). 
If c<i ^ 1 for some io G {2, . . . , £}, then a(t) = ai (t) has the advertised property. If a, = 1 for 
% = 2, . . . , £, let io be such that the function 6j (t) has maximal growth. Then a(t) = a« (t) has 
the advertised property. This completes the proof. □ 

We are now ready to give the proof of Theorem 12.91 We recall its statement for convenience. 

Theorem 5.5. Suppose that {ai(t), . . . , a#(t)} is a "nice" family of functions in ££. 
Then the factor Z is characteristic for the family {[ai(n)], . . . , [ct^(re)]}. 

Proof. Arguing as in the beginning of the proof of Proposition 14.71 we see that it suffices to 
show the following: If (X,X,fJL,T) is a measure preserving system, f\ G L°°(^) is orthogonal 
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to the nilfactor the family of functions T = {a\(t), . . . , ai(t)} is nice, and the function a\(t) 
has maximal growth within J 7 , then 



(63) lim 



sup Ei<„,<at 
II/oIU,||/ 2 || 00 ,...,||/WI 00 <1 



f Q . r [ai(n)]^ . T [a2(n)]j 2 . _ _ _ . j-K(n)]^ 



0. 



We are going to use induction on the type of the family {a\(t), . . . , ag(t)} to verify this state- 
ment. 

Proposition 15.31 shows that the result holds for d = 0. Suppose now that d > 1, and our 
statement holds for all families with type smaller than (d, n^, . . . , no). Let T = {a\(t), . . . , a^(t)} 
be a family with type (d, n^, ■ ■ ■ , no) (then a\{t) >~ t since a\{t) has maximal growth rate in J-). 

Choosing functions /^jv with H/^jvll < 1, so that the average in ([63]) is 1/N close to the 
supremum, and using the Cauchy-Schwarz inequality, we see that it suffices to show that 



lim Ei<„<Ar 

7V-s>oo ~ ~ 



0. 



fo,N ■ T^^/i ■ T^Wfr, N T^f ijN dii 
Equivalently, it suffices to show that 

lim Ei<„, <A r [ F 0N - 5 [ai(n)] Fi • S [a2< - n)] F 2 N -...- S Mn)] F e N d(u x u) = 
iV— »oo ~ ~ J ' 



where S = T x T, F\ = fx ® f 1 , and F^jy = fi,N &> /i at, for i = 0,2, ... ,£ Using the 
Cauchy-Schwarz inequality, and then Lemma [4.61 we reduce matters to showing for every large 
h that 



lim 

W-5-oo 



E 



n <7V J S^ {n+h ^F l ■ S^ n+h ^F 2 , N • . . . • S^ n+h ^F e>N - 

S [ai(n)]p i . 5 [aa(n)]^ ..... S Mn)] F^ N d{u X /i) 



0. 



Factoring out the transformation gK™)] where a(t) E {ai(i), . . . , ai(t)} is as in Lemma 15.41 we 
see that it suffices to show that for every large h we have 



lim Ei<„<Ar 

Af-s>oo _ _ 



g[ai(n+h)—a(n)]+e 1 (n)p^_g[a2(n+h)-a(n)]+e2{n)p 2 . (y{a e (n+h)-a(n)]+e e (n) p 



g[ai{n)-a(n)]+ei +1 (n) p^ . gla2{n)-a(n)]+e e+2 {n) p^ ^ g[a t {n)-a{n)]+e 2l (n) p ^ ^ ^/ x \ 







for some error terms ej(n) that take values in {0, 1}. Therefore, it suffices to show that for 
every large h we have 



(64) 



lim 

iV-s-oo 



sup Ei< n <iv / S'[ ai(n+ ' l )~ a(n )]Fi • 5'[ a2 ( n+/l )~ a ( n )]F 2 • . . . • sM n+h )~ a ^ n )1 Fg- 
llfblloc.-.llfSulloo^i ~ n ~ 

S [ ai (n)-a(n)] p i+i . S h(»)-«(»)] f|+2 ..... 5 K (n) -a(n)] ^ ^ x ^ =Q 



Next notice that if ai(t) -< t for some i £ {2, . . . , £}, then a,i(n + h) — aj(n) — >■ 0. Therefore, for 
our purposes we can practically assume that [oj(n + h) — a(n)] = [a,(n) — a(n)] for all n, and 
for those values of i we can write 

g[ai(n+h)—a(n)] p, . <j4<ii(ri)— a(n)]Ji, _ g[ai(n)—a(n)]\p\2 



After doing this reduction, for every large h the iterates that appear in the integral in (I64j) 
involve functions that belong to the family F{a{t), h) (defined at the beginning of the current 
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subsection). This family is "nice" and by Lemma [5.4l has type smaller than that of T . Further- 
more, one of these iterates appearing in this reduced form is S[ a H n +' l )~ a ( n )]_F 1 and the function 
a\(n + h) — a(n) has maximal growth in F(a(t), h) . Since f\ is orthogonal to the factor Z(T), 
we have that Fi = /i<8>/i is orthogonal to the factor Z(S). Therefore, the induction hypothesis 
applies, and gives that the limit in (j64j) is for every large h. This completes the induction 
and the proof. □ 



6. Proof of the convergence and the recurrence results 

In this section we combine Theorems 12.41 and 12.91 and the equidistribution results from |14j 
to prove the convergence and recurrence results stated in Sections 12.11 and 12.21 

6.1. Convergence results. We prove the convergence results stated in Sections 12.11 and 12.21 
To prove Theorem 12.11 we will need the following result: 

Theorem 6.1 (F. [2]). Suppose that the function a G T~L has polynomial growth and satisfies 
one of the three conditions stated in Theorem \2.1\ 

Then for every nilmanifold X = G/T, F G C(X), b G G, and x G X, the following limit 
exists lim^oo i En=i F(b^x). 

Proof of Theorem \2.l\ The necessity of the conditions was proved in [T2j by using examples of 
rational rotations on the circle. 

To show that the three stated conditions are sufficient for convergence we start by using 
Theorem 12.41 We get that the nilfactor Z is characteristic for the corresponding multiple 
ergodic averages. Using an ergodic decomposition argument and Theorem 13.31 we deduce that 
it suffices to prove convergence when our system is an inverse limit of nilsystems. A standard 
approximation argument allows us to finally reduce matters to nilsystems. 

Let (X = G/T, G/T, mx,Tb) be a nilsystem and F\,...,Ft G L°°(mx)- Our goal is to show 
that if the function a G H satisfies one of the three stated conditions, then the limit 

(65) lim Ek^nF^'^x) ■ F 2 (b 2[a{n)] x) F e (b £[a{n)] x) 

exists in L 2 (mx)- By density, we can assume that the functions F\,...,Fg are continuous. In 
this case we claim that the limit in (I65p exists for every Indeed, applying Theorem 16.11 

to the nilmanifold X k , the nilrotation b = (b, b 2 , . . . , b e ), the point x = (x, x, . . . , x), and the 
function F(x\, . . . , xg) = F\{x\) ■ ^2(^2) • • • Fg(xi), we get that the limit 

lim E l<n<N F(U a ^x) 

TV— >-oo _ _ 

exists. This implies that the limit in (|65p exists for every x £ X and completes the proof. □ 

To prove Theorem 12.21 we will need the following result: 

Theorem 6.2 (F. |14|). Let a G H have at most polynomial growth and satisfy \a(t) — cp(t)\ y 
logi for every c G M and p G Z[t]. 

Then for every nilmanifold X = G/T, b G G, and x G X, the sequence (b^ n ^x) n i=f,i is 
equidistributed in the nilmanifold (b n x) n£ ^. 

Proof of Theorem \2.2l Arguing as in the proof of Theorem 12.11 we reduce matters to showing 
the following: Let a £ fi satisfy the assumptions of our theorem, X = G /T be a nilsystem, 
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b £ G be a nilrotation, and F\, . . . , F$ £ C(X). Then for every x G X the limit in (|65p exists 
and is equal to the limit 

lim Ek^^^i) • F 2 (6 2 "x) • . . . • F,(6 fa x). 

Keeping the same notation as in the proof of Theorem 12,21 we rewrite the desired identity as 

lim E 1<n<N F(U a ^x) = lim E 1<n<N F(b n x). 

iV— >oo N—too 

Using Theorem 16. 2[ and the fact that the sequence (b x) n£ n is equidistributed in the set 
{b n x, n G N}, we deduce that the last identity holds for every x £ X , completing the proof. □ 

Next we prove Theorem 12.61 using the following result that will be established in Section [6.31 

Proposition 6.3. Suppose that the functions a±, . . . ,ai £ C£ n Q have different growth rates. 
Then for every nilmanifold X = G/T, b £ G ergodic, and F%, . . . , Fg £ C(X) we have 

lim E 1 < n<7V i 7, i(6 [ai(n)] x) • . . . • F e (b [ae{n)] x) = [ F 1 dm x [ F e dm x 

N^oo J J 

where the convergence takes place in L 2 (mx)- 

Proof of Theorem \2.6l Using Theorem 12.91 and arguing as in the proof of Theorem 12. li the result 
follows from Proposition 16.31 □ 

Lastly, we prove Theorem 12.71 Its proof has a rather soft touch of ergodic theory; in fact it 
is an easy consequence of the following general statement: 

Proposition 6.4. Let a±, . . . ,a£ £ C£ n Q have different growth rates and satisfy di(t) -< t 
for i = 1,...,£. Let (X,X,fi) be a probability space, and for i = 1,...,£ let (Fi(n)) n ^ 
be sequences of functions in L°°(n) with uniformly bounded norm such that the limits F{ = 
]ha N ^ M -^oo^M<n<N F i(n-) exist in L 2 (fi). 
Then 

lim E Kn aFi(K(n)]) • . . . • F e ([a e {n)}) =F 1 -...-F l 

iV-»oo _ _ 

where the limit is taken in L 2 (fi). 

Proof. As in the proof of Proposition 15.31 it will be convenient to work with the larger Hardy 
field V of Pfaffian functions which is closed under composition and compositional inversion. 
We use induction on I. For 1=1 the result follows from Lemma 15.11 

Suppose that the result holds for t — 1. Without loss of generality we can assume that 
di(t) ~< ai(t) for i = 1, . . . ,£ — 1. Assuming that Fg = it suffices to prove that 

(66) lim EKnaF^Kfn)]) • . . . • F e ([a e (n)\) = 

where the convergence takes place in L 2 (fi). For i = 1, . . . £ — 1 we let 5j(i) = aj(a^ 1 (t)) which 
is again an element of V . Since the functions a\(t), . . . , a^{t) have different growth rates, belong 
to 0, and ae(t) has maximal growth, the same is the case for the functions di(t), . . . , de-\(t),t, 
and di{t) -< t for i = 1, ...,£ — 1. Keeping this in mind and using Lemma 2.7 in [7] we get that 
t £ -< di(t) -< t for some e > 0. Furthermore, by Lemma 2.12 in [7] we have [aj(rt)] = aj([ag(n)]) 
for a set of n £ N of density 1. It follows from Lemma [5. II that in order to verify (|66p it suffices 
to show that 

(67) lim Ei<„<^i([Si(n)]) • . . • • F<_i([5<_i(n)]) • F e (n) = 0. 
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Since for i = 1, ...,£— 1 the functions di{t) belong to some Hardy field and satisfy ctj(i) -< t 
we have that di(t + 1) — di(t) —> 0. Using this, we see that there exists a sequence (irra)meN of 
non-overlapping intervals, with \I m \ — > oo, and such that the sequences [ai(n)], . . . , [a^_i(n)] 
are constant on every interval I m (for technical reasons we can also assume that \I m \ -< m). 
Then (1671) follows if we show that 



(68) lim E l < m < M (G m • E neIm F e (n)) = 0, 
where 

G m = F 1 ([5 1 (n m )]) • . . . • F e _i([d t -i(n m ))), 
and n m is any element of the interval I m . Furthermore, since the functions G m have uniformly 
bounded L°° norms, it suffices to show that 

(69) lim Ex< m <Af \\E n£lm F e (n)\\ L2( ) = 0. 
Our assumption gives that 

lim ||E„ e/m F £ (n)|| L2( } = 0, 
which immediately implies f)69[) . This completes the proof. □ 

We deduce Theorem 12.71 from Proposition 16.41 
Proof of Theorem \2. 1\ By the (uniform) mean ergodic theorem we have 

lim ||E M <n<iV?r./;|| i2W = E(fip(Ti)) 

where the convergence takes place in L 2 (/i). We can therefore apply Proposition 16.41 for the 
sequences (Fi(n)) n ^, i = defined by Fi(n) = Tffi to conclude the proof. □ 

6.2. Recurrence results. We prove Theorem 12.31 using the following multiple recurrence re- 
sult that will be handled later. 

Proposition 6.5. Let a & % be of the form a(t) = p(t)a + e(t) for some p E Z[t], a E R, 
and 1 -< e(t) -< t. Let (X,X,p,,T) be a system and f E L°°(/u) be non-negative and not almost 
everywhere zero. 

Then for every I E N we have 

(70) lim sup E M <n<iV / / • T^f • . . . • T e ^f dfJi > 0. 
N-M^oo J 

Proof of Theorem \2.3l If \a(t) — cp(t)\ y logi for every c E R and p E Z[i], then the result 
follows immediately by combining Theorem 12.21 and Furstenberg's multiple recurrence theorem 
([18J). Furthermore, the case where a(t) = cp(t)+e(t) for some c E R, p E Z[i], and e(t) <C \ogt, 
is taken care by Proposition 16.51 This completes the proof. □ 

Lastly, we deduce Theorem 12.81 from Theorem 12.61 

Proof of Theorem \2.8l Let fj, = J ntdX(t) be the ergodic decomposition of the measure [i. Using 
Theorem 12.61 for the ergodic systems (X, X,fit,T) we get that 



lim Ei <n<N 

N-too ~ 



n(A n r^^Un r^^U n • • • n r^WU) = J (jn(A)) t+1 d\(t). 



Using Holder's inequality we see that the last integral is at least 

MA) dX(t)y +1 = (KA)) e+1 - 
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This proves the advertised estimate and completes the proof. □ 

6.3. Proof of Proposition [5751 In the case where all the functions a\(t), . . . , ae(t) have super- 
linear growth Proposition 16.31 is a direct consequence of the corresponding pointwise result in 
|14| (Theorem 1.3). The general case can be covered using a modification of an argument used 
in |14j . To avoid unnecessary repetition our proof will be rather sketchy. 
We are going to use the following: 

Proposition 6.6. Suppose that the functions a\, . . . ,ag G C£ n Q have different growth rates. 
Then for every nilmanifold X = G/T, b G G, x G X , and F G C(X^), we have 



lim lim Ei<„<tv 

R— >oo s-oo 



El<r<fli ? (^ [ai(i?n+r)] X, . . . , b MRn+r)] x) 







where Xt, = {b n x: n G N}. 



Sketch of Proof. Using a straightforward modification of the reduction argument of Section 
5.2 in [14], we can reduce matters to showing that for every nilmanifold X = G/T, with G 
connected and simply connected, b G G ergodic, and F G C(X i ), we have 



lim lim Ei< ra <jv 
R— >oo TV— >oo ~ ~ 



Ei< r < R F{b^ Rn+r ^x, • • ■ M ai{Rn+r)] x) ~ j F dm xt 



0. 



This was verified while proving Proposition 5.3 in |14j . completing the proof. □ 

Proof of Proposition \6.<ft For convenience of exposition we are going to assume that all the 
limits mentioned below exist; their existence will follow once our argument is completed. 

Our goal is to show that for every nilmanifold X = G/T, b G G ergodic, and Fi, . . . ,Fg G 
C(X) we have 

lim Ei<„<ArFi(6 [ai( " )] x) • . . . • F e (b [ae( - n)] x) = [ Fi dm x ■ ■■■■ I F e dm x 
N-^oo J J 

where the convergence takes place in L 2 (mx)- 

If all the functions a\{t),. . . ,ag{t) are sub-linear, then the result follows from Theorem 7.3 
in [7j, and if all the functions are super-linear, then the result follows from Theorem 1.3 in [1 i|. 
If none of these is the case, we can assume that a\(t), . . . , a m (t) >- t and a m+ i{t), . . . , ai{t) -< t 
for some m G {1, ... ,1— 1}. 

Let F G C{X m ) and G G C(X e - m ). It suffices to show that 
(71) 



lim Ei< n < 

N->oo 



^(F(6[ ai ( n )^,...,fe[ a ™W]x)-G'(6[ am + 1 ^x,...,6 [a ^ n ) 1 x^ = J Fdrnxm J Gdm x i- m 



where the convergence takes place in L 2 (mx). For every R G N the limit in (I7ip is equal to 
(72) 

lim E 1<n<N (E 1<r<R (F(b^ nR+r ^x, b^ nR+r ^x) ■ G{b^ m+l{ - nR+r ^x, ftM^+OL 

Since the functions a m +i, . . . , ai G ££ are all sub-linear, we can show (see Lemma 2.12 in [7]) 
the following: for every R G N, for a set of n G N of density 1, we have [aj(ni? + r)] = [ai(nR)\ 
for r = 1, . . . , R and i = m + 1, . . . , i. We deduce that the limit in (|7"2l) is equal to 

(73) lim Ei< n <jv ( G(b^ m + l( - nR ^x, b^ nR ^x) ■ E 1<r<R F(U ai( - nR+r ^x, bM nR + r ^x)) . 
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Using Proposition 16.61 we see that the limit of the expression (|73p as R — > oo is equal to 



lim lim K 1<n<N G(b^ am+l( - nR ^x, . . .,b MnR)] x) ■ [ F dm x ™. 

R-^oo N— >oo J 

Since b G G is ergodic and the functions a m+ \, . . . ,ae G ££ D Q are all sub-linear and have 
different growth, by Proposition 6.4 in [7] we have (for every R G N) that 

lim E 1<n<N G(U am+l( - nR ^x, . . . ,b^ nR ^x) = [ G dm x e- m 

N^oo ~ ~ J 

where the convergence takes place in L 2 (/i). Combining all the previous facts we get (|7ip . This 
completes the proof. □ 



6.4. Proof of Proposition 16.51 First, we informally discuss the proof strategy of Proposi- 
tion 16.51 When the function a(t) is logarithmically close to a constant multiple of an integer 
polynomial the relevant multiple ergodic averages cannot be directly compared with Fursten- 
berg's averages (because of the luck of equidistribution) . To bypass this difficulty we work 
with an appropriate subsequence of the sequence [a(n)]. This subsequence is chosen so that it 
becomes possible to compare the corresponding multiple ergodic averages with those along the 
sequence [na]. For the latter averages positiveness follows easily from Furstenberg's multiple 
recurrence theorem, thus achieving our goal. 

To carry out this plan we need a few preliminary results that enable us to carry out the 
aforementioned "comparison step". 

We start with an equidistribution result on nilmanifolds. To prove the claimed uniformity 
we make use some quantitative equidistribution results (this is the only place in the present 
article where we make explicit use of such results). 

Lemma 6.7. Let X = G/T be a connected nilmanifold, b £ G be an ergodic nilrotation, and 
p G Z[i] be non-constant. 

Then for every F £ C(X) we have 



(74) 



lim max 



E 



M<n<N 



F(V { 



F dmx 



0. 



Proof. We argue by contradiction. Suppose that (I74p fails for some connected nilmanifold X, 
ergodic b G G, p G Z[i] with k = deg(p) > 1, and F G C{X). Then there exist 5 > 0, x m G X, 
and sequences of positive integers (n m ) mg N, (N m ) me fq, with N m — > oo, such that the sequence 
Xm)i< n <N m is not 5-equidistributed in X for every m G N. 
Then by Theorem 13.21 (suppose that 5 is small enough so that the theorem applies) there 
exists a constant M 
and such that 

(75) 



M(X,5,k), and a sequence of quasi-characters tfj m , with 



< M, 



+n), 



C°°[7V„ 



< M 



for every m G N. As explained in Section I3.2| the affine torus A of X can be identified with 
a finite dimensional torus T'. After making this identification, we have ip m (t) = K m ■ t for 
some non-zero K m G Z J , and the nilrotation b induces a d-step unipotent affine transformation 
Tfc : T' — > T l . Let %(b) = (/?iZ, . . . , /3 S Z), where fa G R, be the projection of b to the Kronecker 
factor of Tf, (notice that s is bounded by the dimension of X). Since b acts ergodically on X 
the set B = {1, fa, ... , fa} is rationally independent. For every x G T the coordinates of T^x 
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are polynomials of n, and so K m ■ T^x is a polynomial of n. Moreover, the leading term of the 
polynomial K m ■ T^x has the form 7 m n d , where 1 < d < d, and 

1 8 

(76) 7m = tt / r m i/3i, where r 8 G Z are not all zero and \r m A < c\ ■ M 

k\ z — ' 

i=i 

for some constant c\ that depends only on 60 Using this and the definition of IHIoojat] (see 
([19])), it follows that 



HT P(n m +n) Xm) 



Combining this with (|75p we get that 
(77) ||7m|| < 



Since by (|76|) we have a finite number of options for the irrational numbers 7 m , and N m — > oo, 
letting m — > oo we get a contradiction. This completes the proof. □ 

Next we use the Lemma 16.71 to establish a key identity. We remark that if G is a connected 
and simply connected nilpotent Lie group, then there exists a unique continuous homomorphism 
6: R — » G with 6(1) = 6. For be G and t E R, by 6* we mean the element b{t) E G. 

Lemma 6.8. Let X = G/T be a nilmanifold with G connected and simply connected, a G R 
be non-zero, and b £ G be such that the sub-nilmanifold {{naL,b na Y),n £ N} of T x X is 
connected. 

Then for every F E C(X), F0lner sequence (& m )m£N in Z, and non-constant p E Z[t] we 

(78) lim E ne ^ m F(b^ a+m ^T) = lim Ek^F^L). 

Proof. We first do some maneuvers that enable us to remove the integer parts and bring us to 
a point where Lemma 16.71 is applicable. We let X be the nilmanifold T x X which we identify 
with (R x G)/(Z x T), set 6 = (a, b a ), and define the function F on X by 

F(tZ,gT)=F(b-^gT). 

Notice that for every t E R we have 

(79) F(6*f) = F(b- {at} b at T) = F(b^F) 

where r = Z x T. (We caution the reader that the function F is not continuous on X.) 

By assumption the nilmanifold X = {6 ra r, n E N} is connected. Since the nilrotation 6 acts 
ergodically on the connected nilmanifold X, part of the hypothesis of Lemma 16.71 is satisfied. 

Next we claim that Lemma 16.71 can be applied for the restriction of the function F to the 
nilmanifold X, namely we claim that 



(80) lim max E M <n<NF{lP {n) x) -IF dm* 

TV-M-kx) i& x J 



0. 



Suppose first that a is rational. Since X is a connected nilmanifold and r E X, we have 
that X C {Z} x X. Hence, the restriction of F onto X is given by F(Z,gT) = F(gT) (for 



n For example, if T: T 2 -> T 2 is defined by T(x,y) = (x + a, y + 2x + a) and k = (fci,fc 2 ), then k-T n (x,y) = 
fecm 2 + (kia + 2k2x)n. Therefore, the corresponding leading term is either kiar? (if &2 7^ 0), or kian (if ki — 0). 
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those (Z, gT) that belong to X), and as a result is continuous. In this case, (|8(J|) is a direct 
consequence of Lemma 16.71 Therefore, it remains to verify (|80p when a is irrational. Notice 
first that the set of discontinuities of F on X is a subset of the nilmanifold {Z} x X. Near a 
point (Z, ^r) of {Z} x X the function F comes close to the value F(gT) or the value F(& _1 <7r). 
For 5 > (and smaller than 1/2) let =JjxI where 1,5 = {iZ: ||t|| > <5}. There exist 
functions Fg G C(X) that agree with F on and have sup- norm bounded by 2 HF^. Since a 
is irrational, the sequence (p(n)aZ) n6 N (which happens to be the first coordinate of (i) p ^)n£'N) 
is well distributed in T, and result 

lim max \E M<n<N li$ ig]({x + p(n)a}) - (1 - 25) \ = 0. 

N—M-+OQ xe[0,l] 

It follows that 

(81) limsup maxE M < n < N \F(&Wx) - F s {^ n) x)\ < 3 {{F^ 5. 

AT— M-»oo xeX 

By Lemma 16.71 equation (|80p holds if one uses the functions Fg in place of the function F. 
Using this and (|8ip . we deduce (|80p. This proves our claim. 
Next notice that (|8U|) gives that 

(82) lim E ne$m F(^") +m f ) = IP d mjc . 

Furthermore, since the nilrotation b acts ergodically in X, the last integral is equal to 

(83) lim E 1<n<iV F(6 n f). 

N— >oo ~ ~ 

(If F is not continuous we argue as before to get this.) Using (|79p we see that 

(84) F (jp{n)+mf} = F ( b lp(n)*+ ma ] T ^ and F(b n f) = F(b [na] T). 

Combining {82]), ®, and ® , we get ([111). This completes the proof. □ 

The next lemma will be used later to verify that certain connectedness assumptions (needed 
to apply Lemma I6.8P are satisfied. 



Lemma 6.9. Let X = G/T be a connected nilmanifold and b E G be an ergodic nilrotation. 

Then there exists a connected sub-nilmanifold Z of X e such that for a.e. g G G the element 
b g = (g _1 bg, g~ l b 2 g, . . . ,g _1 b e g) acts ergodically on Z. 

Remark. The independence of Z of the generic g G G will not be used, only that Z is connected 
will be used. 

Proof. This is an immediate consequence of a limit formula that appears in Theorem 2.2 of 
[37] (the details of the deduction appear in Corollary 2.10 of [33]). □ 



As mentioned before, our plan is to establish a positiveness property by comparing certain 
multiple ergodic averages to some simpler ones involving the sequence [net]. The next lemma 
establishes the positiveness property needed for the latter averages. 

Lemma 6.10. Let (X, X , u,T) be a system, I G N, and a be a non-zero real number. 
Then for every f G L°° (//) positive and not almost everywhere zero we have 

(85) liminf Ei<„<tv / / ■ T H / • . . . • T e ^f du > 0. 

N^roo J 
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Proof. We follow closely an argument used in [6] (Theorem 2.3). 

For a general sequence (a(n)) ne pj of non-negative numbers, and m G N, one has 

liminf Ei< n <Ara(n) > — lim infEi< n < Naimn). 

N->oo ~ ~ m N->oc ~ ~ 

Using this for m = 1, . . . ,iVo, where iVo is an integer that will be chosen later, and averaging 
over m we get 

liminf E 1<n<N a(n) > Ei <m< No ( —limmi~Ei <n< N a(mn) 

The last expression is greater or equal than 

— lim inf Ei< n < N (Ei< m < N() a(mn)) . 
1\q N—>co 

As a result, if there exists N$ G N such that 

^i< m <N a(mn) > c> 
for a set S of n G N of positive lower density, then 

liminf Ei< n <jvo(n) > 0. 

N— >oo 

a(n) = f f ■ T [na] f ■■■■■ T^f dpi. 
to prove ([85]) . 

We choose iVo as follows: By the uniform multiple recurrence property (|8j), there exists 
Nq G N, and positive constant c (depending only on f fd/j, and £), such that for every r G N 
one has 



We shall use this for 



Ei< m <iv / / • T rm f ..... T lrm f dfi > c. 



We choose S = {n: {na} < l/A'o}, which as is well known has positive density. Since 
[m/3] = m[f3] whenever {/?} < 1/m, we have 

a{mn) = J f- T m ^f T £m ^f du 

for every n G S and m = 1, . . . , iVo- As a result, for every n G S we have 

Ei< m < JVo a(mn) = Ei< m a / / ■ T m ^ f T £m ^ f da > c 

where the last estimate follows form (I86D applied for r = [na]. As explained before this implies 
(j85j) and completes the proof. □ 



We now have all the ingredients needed to prove Proposition 16.51 

Proo f o f Proposition 1 6. 5t If a = 0, or p is constant, we have 1 -< a(t) -< t. Since a G H, it 
follows that for every large m the sets {n G N: [a(n)] = m} are intervals with length increasing 
to infinity as m — > oo. Using this, the result follows from Furstenberg's multiple recurrence 
theorem ([18]). 

Suppose now that q / and p is non-constant. We start with some reductions. We can 
assume that p(0) = 0. Furthermore, using an ergodic decomposition argument we can assume 
that the system is ergodic. By Theorem 12.41 we can reduce matters to showing (170|) in the case 
where the system is an inverse limit of nilsystems. Lastly, an argument completely analogous 
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to the one used in the proof of Lemma 3.2 in [20] shows that the positiveness property (|70p is 
preserved by inverse limits. Hence, we can assume that the system is a nilsystem. 

Suppose now that X = G/T is a nilmanifold, b £ G, and F G L°°(mx) is non-negative 
and not almost everywhere zero. Our goal is to show that there exists a sequence of intervals 
(-fm)meN with length converging to infinity such that 

(87) lim E neIm V([a(n)]) > 

where 

' iirc„ 



V(n) = J F(x) ■ F(b n x) • . . . • F(b m x) dm X - 

The choice of the intervals (/ m ) m eN will be made so that the values of the sequence [a(^)] for 
n G I m take a convenient form. More precisely, since 1 -< e(t) -< t and e G T~L, for every r G N 
(to be chosen later), there exists a sequence of intervals (J r ,m)m£N with \ J r ,m\ — > oo, such that 

mra < e(n) for n G J riJn and sup (e(n) — mra) — > 0. 

For 5 > (to be chosen later) define 

S r>s = {n € N: {p(n)ra} < 1 - 6}. 
Then for every large m, if n G J rjm fl we have 

[a(n)] = \p(n)a + mra]. 

It follows that in order to verify (j87|) it suffices to show that for some r G N and 5 > we have 

(89) limsupE neJr m (F([p(n)a + mra}) ■ l SrJ (n)) > 0. 

In fact it suffices to show that there exists r G N such that 

(90) lim sup E nE j r>m V(\p(n)a + mra]) > 0, 

m— >oo 

the reason being that for every r G N, a G R, and 5 small, we have 

nmsupE neJr (1 - l Sr ») < 6,E 



and as a result (J90J) readily implies (189|) . again if <5 is small. Therefore, we can concentrate our 
attention in proving (f9Q|) . 

We shall show that there exists an r G N such that 

(91) lim E ne ^ m V (\p(rn)a + mra]) = lim Ei <n<N V([nra\) 

for every F0mer sequence ($ m ) mg N in Z. Having shown this, we can finish the proof by using 
the definition of V(n) and Lemma 16.101 We deduce that the limits in ()9ip are positive, and as 
a result fl90|) holds. 

We proceed now to find this r G N. Using a standard lifting argument we can also assume 
that the group G is connected and simply connectedEl A standard approximation argument 
shows that it suffices to verify (|91j) whenever the function F in (I88p is continuous. Our plan is 



12 If a is irrational the estimate holds with equality (this follows from equidistribution) . If a is rational the 
estimate holds trivially for 8 small. 

^We can assume that G/Go is finitely generated. In this case, one can show ([28]) that there exists a 
nilmanifold X — G/T, where G is a connected and simply-connected Lie group, such that for every F £ C(X), 
be G, and x £ X, there exists F G C(X), b € G, and x e X, such that F(b n x) = F(b"x) for every n e N. 
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to use Lemma 16, 81 to establish a stronger pointwise result, namely, there exists an r G N such 

that for a.e. x G X we have for every Folner sequence (3>rn)meN in 2 that 

(92) 

lim E ne $ m F(b^ rn ^ a+mr ^x)-. . .-Fib^^+^x) = lim E 1<n<N F(b^x)-. . .-F(b e[nra] x). 

m— »oo m— >oo ~ ~ 

(Our argument will show that both limits exist.) Trivially, this pointwise result implies (|9ip . 
Therefore, we are left with establishing ([92]). 

We do some preparation in order to use Lemma 16.81 For every n G N and element x = gT 
of X we have 

(93) F{b n x)- .... F{b in x) = F{b n g t), 
where F(xi, . . . , x t ) = F{g Xl ) ■ F(gx e ) (g C(X e )), f = and 

b g = (g- 1 bg,g- 1 b 2 g,...,g- 1 b e g)£G e . 
Next we show that there exists an r £ N such that for a.e. g £ G the sub-nilmanifold 

(94) {(nra,b™ ra f),n G N} 

of T x is connected. Indeed, as mentioned in Section 13.21 there exists ro such that the 
nilmanifold {(raroaZ, 5 nr °°T), n G N} is connected. It follows form Lemma 16.91 (applied to the 
nilmanifold T x X and the element (roa, b roa ) G T x G) that for almost every g G G the 
sub-nilmanifold 

{(nr aZ, . . . , ^nr aZ, ^ roQ f), n G N} 

of x X e is connected. Projecting in the appropriate coordinates we get that the nilmanifold 
in (JM]) is also connected. 

We are now in a position where we can apply Lemma [6. 81 for the nilmanifold X , the function 
F G C(X ), the integer ro we just found, the element roa £ 1 in place of a, the polynomial 
q G Z[i] defined by g(t) = p(rot)/ro (remember ^(0) = 0), and the elements 6 9 G G l (for those 
<7 G G for which the set (f94"|) is connected for r = r ). We deduce that for almost every g G G, 
and for every Folner sequence (<& m ) m eN in ^> we have that 

lim E ne$m F(&[f( ron ) Q+ ™]f) = lim E 1<n<N F{b^ r °^T ). 
Using (j93l) . this gives (|92]l . which in turn gives ([91~|) . This completes the proof. □ 
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